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Description 

This invention relates to the production and manipulation of proteins using recombinant techniques in 
suitable hosts; more specifically to the directed mutagenesis of enzymes in order to modify the 
5 characteristics thereof. 

Various bacteria are known to secrete proteases at some stage in their life cycles. Bacillus species 
produce two major extracellular proteases, a neutral protease (a metalloprotease inhibited by EDTA) and 
an alkaline protease (or subtilisin, a serine endoprotease). Both generally are produced in greatest quantity 
after the exponential growth phase, when the culture enters stationary phase and begins the process of 
w sporuiation. The physiological role of these two proteases is not clear. They have been postulated to play a 
role in sporuiation (J. Hoch, 1976, "Adv. Genet." 75/69—98; P. Piggot etaL, 1976, "Bact. Rev." 40:908—962; 
and F. Priest, 1977, "Bact. Rev." 47:711—753), to be involved in the regulation of cell wall turnover (L. 
Jolliffe eta/., 1980, "J. Bact." 747:1192—1208), and to be scavenger enzymes (Priest, Id.). The regulation of 
expression of the protease genes is complex. They appear to be coordinated regulated in concert with 
15 sporuiation, since mutants blocked in the early stages of sporuiation exhibit reduced levels of both the 
alkaline and neutral protease. Additionally, a number of pleiotropic mutations exist which affect the level of 
expression of proteases and other secreted gene products, such as amylase and levansucrase (Priest, Id.). 

Subtilisin has found considerable utility in industrial and commercial applications (see U.S. Patent No. 
3,623,957 and J. Millet, 1970, "J. Appl. Bact." 33:207}. For example, subtilisins and other proteases are 
20 commonly used in detergents to enable removal of protein-based stains. They also are used in food 
processing to accommodate the proteinaceous substances present in the food preparations to their desired 
impact on the composition. 

Classical mutagenesis of bacteria with agents such as radiation or chemicals has produced a plethora 
of mutant strains exhibiting different properties with respect to the growth phase at which protease 
25 excretion occurs as well as the timing and activity levels of excreted protease. These strains, however, do 
not approach the ultimate potential of the organisms because the mutagenic process is essentially random, 
with tedious selection and screening required to identify organisms which even approach the desired 
characteristics. Further, these mutants are capable of reversion to the parent or wild-type strain. In such 
event the desirable property is lost. The probability of reversion is unknown when dealing with random 
30 mutagenesis since the type and site of mutation is unknown or poorly characterized. This introduces 
considerable uncertainty into the industrial process which is based on the enzyme-synthesizing bacterium. 
Finally, classical mutagenesis frequently couples a desirable phenotype, e.g., low protease levels, with an 
undesirable character such as excessive premature cell lysis. 

Special problems exist with respect to the proteases which are excreted by Bacillus. For one thing, 
35 since at least two such proteases exist, screening for the loss of only one is difficult. Additionally, the large 
number of pleiotropic mutations affecting both sporuiation and protease production make the isolation of 
true protease mutations difficult 

Temperature sensitive mutants of the neutral protease gene have been obtained by conventional 
mutagenic techniques, and were used to map the position of the regulatory and structural gene in the 
40 Bacillus subtilis chromosome (H. Uehara et al., 1979, "J. Bact" 735:583—590). Additionally, a presumed 
nonsense mutation of the alkaline protease gene has been reported (C. Roitsch et al. r 1983, "J. Bact" 
755:145—152). 

Bacillus temperature sensitive mutants have been isolated that produce inactive serine protease or 
greatly reduced levels of serine protease. These mutants, however, are asporogenous and show a 

45 reversion frequency to the wild-type of about from 10~ 7 to 10" 8 (F. Priest, Id. p. 719). These mutants are 
unsatisfactory for the recombinant production of heterologous proteins because asporogenous mutants 
tend to lyse during earlier stages of their growth cycle in minimal medium that do sporogenic mutants, 
thereby prematurely releasing cellular contents (including intracellular proteases) into the culture 
supernatant. The possibility of reversion also is undesirable since wild-type revertants will contaminate the 

so culture supernatant with excreted proteases. 

Bacillus sp. have been proposed for the expression of heterologous proteins, but the presence of 
excreted proteases and the potential resulting hydrolysis of the desired product has retarded the 
commercial acceptance of Bacillus as a host for the expression of heterologous proteins. Bacillus 
megatenum mutants have been disclosed that are capable of sporuiation and which do not express a 

55 sporulation-associated protease during growth phases. However, the assay employed did not exclude the 
presence of other proteases, and the protease in question is expressed during the sporuiation phase (C. 
Loshon etai, 1982, "J. Bact." 750:303—311). This, of course, is the point at which heterologous protein 
would have accumulated in the culture and be vulnerable. The present invention enables the construction 
of a Bacillus strain that is substantially free of extracellular neutral and alkaline protease during all phases 

60 of its growth cycle and which exhibits substantially normal sporuiation characteristics. A need exists for 
non-revertible, otherwise normal protease deficient organisms that can then be transformed with high 
copy number piasmids for the expression of heterologous or homologous proteins. 

Enzymes having characteristics which vary from available stock are required. In particular, enzymes 
having enhanced oxidation stability will be useful in extending the shelf life and bleach compatibility of 

65 
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proteases used in laundry products. Similarly, reduced oxidation stability would be useful in industrial 
processes that require the rapid and efficient quenching of enzymatic activity industrial 
Modifying the pH-activity profiles of an enzyme would be useful in making the enzymes more efficient 
m a wide variety of processes, e.g. broadening the pH-activity profile of a nrota 1Tih „L 

n ra ^ U ! ati ° n f ° f procarv f ticcartJonyl hydrolases (principally proteases but including lipases) will facilitate 
preparation of a variety of different hydrolases, particularly those having other modified I propert es sucJ a ! 

10 su m b s S^ wh H 3 ' rat i° S , UbSt : ate SP6Cificity ' TheSe enz V mes can tnen ^ tailored Xe parScular 

CommurT't/ ra-^rMl^r^^"^- F ° r 566 Svends ^ 1976, "Carlsberg Res. 

uommun. 41 (5). 237—291. These methods, however, suffer from the disadvantages of beinrj dependent 

, 5 upon the presence of convenient amino acid residues, are frequently nonspecific in that they modifv al 

S«5 r f,f ,d " e f W f C ° mm0n Side Chains ' and are not ca P able ° f ™a<Sin9 inacces ible amTno aci! 
residues without further processing, e.g. denaturation, that is generally not completely reve sible n 
remstitutmg activity To the extent that such methods have the objective of replTci^ 

» rupp.rntrch C m a ethods. an0ther ^ *™ " *™ m^enUVromh.^ 

Predetermined, site-directed mutagenesis of tRNA synthetase in which a cys residue is converted to 

1^187 1fl RrTH ePOrted H (G ; Wint6r etal - 1982 ' " NatUre " 233:756-758; A. Wilkinson J f a/! sS^Se" 
fZ: ! 88) - Thls ." leth ° d 13 ™t Poetical for large scale mutagenesis. It is an object herein to provide a 
convenient and rapid method for mutating DNA by saturation mutagenesis P 
25 A method for producing procaryotic carbonyl hydrolase such as subtilisin and neutral protease in 
recombinant host cells is described in which expression vectors containing sequences ^ which encode 
des.red subtilisin or neutral protease, including the pro, pre, or prepro forms of these enzymes are used to 
transform hosts the host cultured and desired enzymes recovered" By mutagenesfs, the coding sequence 

30 describerbeTow ^ ^ Pr ° PertieS ° n the protein that is P™"^ «? KS 

The novel strains then are transformed with at least one DNA moiety encoding a polypeptide not 

te ^ZeTrTn^lT^ TV T^T* CU ' tUred and * e PoiYPeptide^SVom 

In, ^ 1 0rdlnar, 'Y< the DNA molet y '8 a directed mutant of a host Bacillus gene, although it may be DNA 

35 exnre , P 9 H a fr eUCary °h ° Yea f l ° T mammaMan) protein - The novel "trains also serve as hos s for protein 
35 expressed from a bacterial gene derived from sources other than the host genome or for vecto s 

expressing these heterologous genes, or homologous genes from the host genome l the latter event 
SU °^ 3 f a T l3Se arB ( ? btalned free 0f neutral protease or subtili «in. In addition, ft s now posslb^ 

Sac!E .oprS 1 hi 'r tUre WhiCh iS ^ ° f en2Vmatica IIV a «ive subtilisin, and v ce-versa Mutated 
.o Son^^ ° r M pr ° tease f °™ the of copending 

thereof? X^^SS^^^ * ^ ^ ~ 

The present invention provides a convenient method for saturation mutagenesis, thereby enablina the 
<s Xi n 2jSl?fl? 9neral,0n " P,Urality ° f mUtati ° nS 3t anY °" e Site Within the -dinVregion'of a 

(a) obtaining a DNA moiety encoding at least a portion of said protein- 

(b) identifying a region within the moiety; 

r^tri^r' n b f tUtin9 n t ucleotides for u those alr eady existing within the region in order to create at least one 
restriction enzyme s, e unique to the moiety, whereby unique restriction sites 5' and 3' to the identified 
50 reg.on are made ava.lab e such that neither alters the amino acids coded for by the region as expressed 
c JlT eSfZ T 9 \ P U . ralitV ° f ^"•""K'tides. the 5' and 3' ends of which each contain sequences 
capable of annealing to the restriction enzyme sites introduced in step (c) and which, when ligated to the 

^Sorpro X te[nT " sufastitutions ' insertions of at ieast one ami'no acid in or into tS 

55 m M^? Stin9 th ! Tj!* °, f St6P { ? ] With restriction e"2ynes capable of cleaving the unique sites; and 
( f) ligatmg each of the oligonucleotides of step (d) into the digested moiety of step (e) whereby a 
plurality of mutant DNA moieties are obtained. wnereoy a 

~,rho V l . h h ^''esoing method a mutation can be introduced into isolated DNA encoding a procaryotic 
sn ft SnJ^ 6 ^ l UP ° n , ex P ression of the DNA ' results in the substitution, deletion or insertion of 
of 111 Z ?° a °! t 3 P^rmmed site in the hydrolase. This method is useful in creating mutants 
of wild type proteins (where the "precursor" protein is the wild type) or reverting mutants to the wild type 
(where the "precursor" is the mutant). YP 
Mutant enzymes are recovered which exhibit oxidative stability and/or pH-activity profiles which differ 

« iit e p PreCU r° t r T 7 ™ 5 - Procar y° tic carbon V hydrolases having varied Km, Kcat. Kcat/Km ratio and 
65 substrate specificity also are provided herein. 
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The mutant enzymes obtained by the methods herein are combined in known fashion with surfactants 
or detergents to produce novel compositions useful in the laundry or other cleaning arts. 

Brief Description of the Drawings 
5 Figure 1 shows the sequence of a functional B. amyloiiquefaciens subtilisin gene. 

In Figure 1A, the entire functional sequence for & amyloiiquefaciens, including the promoter and 
nbosome binding site, are present on a 1.5 kb fragment of the B. amyloiiquefaciens genome. 

Figure 1B shows the nucleotide sequence of the coding strand, correlated with the amino acid 
sequence of the protein. Promoter (p) ribosome binding site (rbs) and termination (term) regions of the 
w DNA sequence are also shown. 

Figure 2 shows the results of replica nitrocellulose filters of purified positive clones probed with Pool 1 
(Panel A) and Pool 2 (Panel B) respectively. 

Figure 3 shows the restriction analysis of the subtilisin expression plasmid (pS4). pBS42 vector 
sequences (4.5 kb) are shown in solid while the insert sequence (4.4 kb) is shown dashed. 
15 Figure 4 shows the results of SDS-PAGE performed on supernatants from cultures transformed with 
pBS42 and pS4. 

Figure 5 shows the construction of the shuttle vector pBS42. 

Figure 6 shows a restriction map for a sequence including the B. subtilis subtilisin gene. 
Figure 7 is the sequence of a functional B. subtilis subtilisin gene 

20 Figure 8 demonstrates a construction method for obtaining a deletion mutant of a B. subtilis subtilisin 
gene. 

Figure 9 discloses the restriction map for a B. subtilis neutral protease gene. 
Figure 10 is the nucleotide sequence for a B. subtilis neutral protease gene. 

Figure 11 demonstrates the construction of a vector containing a B. subtilis neutrla protease gene. 
25 Figures 12, 13 and 16 disclose embodiments of the mutagenesis technique provided herein. 
Figure 14 shows the enhanced oxidation stability of a subtilisin mutant. 

Figure 1 5 demonstrates a change in the pH-activity profile of a subtilisin mutant when compared to the 
wild type enzyme. 

30 Detailed Description 

Procaryotic carbony! hydrolases are enzymes which hydrolyze compounds containing 

0 

35 C— X bonds 

in which X is oxygen or nitrogen. They principally include hydrolases, e.g. lipases and peptide hydrolases, 
e.g. subtihsins or metalloproteases. Peptide hydrolases include a-aminoacylpeptide hydrolase! 
peptidylamino-acid hydrolase, acylamino hydrolase, serine carboxypeptidase, metallocarboxypeptidase! 

40 thiol proteinase, carboxylproteinase and metalloproteinase. Serine, metallo, thiol and acid proteases are 
included, as well as endo and exo-proteases. 

Subtiltsins are serine proteinases which generally act to cleave internal peptide bonds of proteins or 
peptides. Metalloproteases are exo- or endoproteases which require a metal ion cofactor for activity. 
A number of naturally occurring mutants of subtilisin or neutral protease exist, and all may be 

45 employed with equal effect herein as sources for starting genetic material. 

These enzymes and their genes may be obtained from many procaryotic organisms. Suitable 
examples include gram negative organisms such as E coli or pseudomonas and gram positive bacteria 
such as micrococcus or bacillus. 

The genes encoding the carbonyl hydrolase may be obtained in accord with the general method 

so herein. As will be seen from the examples, this comprises synthesizing labelled probes having putative 
sequences encoding regions of the hydrolase of interest, preparing genomic libraries from organisms 
expressing the hydrolase, and screening the libraries for the gene of interest by hybridization to the probes. 
Positively hybridizing clones are then mapped and sequenced. The cloned genes are ligated into an 
expression vector (which also may be the cloning vector) with requisite regions for replication in the host, 

55 the plasmid transfected into a host for enzyme synthesis and the recombinant host cells cultured under 
conditions favoring enzyme synthesis, usually selection pressure such as is supplied by the.presence of an 
antibiotic, the resistance to which is encoded by the vector. Culture under these conditions results in 
enzyme yields multifold g reater than the wild type enzyme synthesis of the parent organism, even if it is the 
parent organism that is transformed. 

60 "Expression vector" refers to a DNA construct containing a DNA sequence which is operably linked to 
a suitable control sequence capable of effecting the expression of said DNA in a suitable host. Such control 
sequences include a promoter to effect transcription, an optional operator sequence to control such 
transcription, a sequence encoding suitable mRNA ribosome binding sites, and sequences which control 
termination of transcription and translation. The vector may be a plasmid, a phage particle, or simply a 

65 potential genomic insert. Once transformed into a suitable host, the vector may replicate and function 
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independently of the host genome, or may, in some instances, integrate into the genome itself In the 
present specification, "plasmid" and "vector" are sometimes used interchangeably as the plasmid is the 
most commonly used form of vector at present. However, the invention is intended to include such other 
forms of expression vectors which serve equivalent functions and which are, or become, known in the art. 

"Recombinant host cells" refers to cells which have been transformed or transfected with vectors 
constructed using recombinant DNA techniques. As relevant to the present invention, recombinant host 
cells are those which produce procaryotic carbonyl hydrolases in its various forms by virtue of having been 
transformed with expression vectors encoding these proteins. The recombinant host cells may or may not 
have produced a form of carbonyl hydrolase prior to transformation. 

"Operably linked' 7 when describing the relationship between two DNA regions simply means that they 
are functionally related to each other. For example, a presequence is operably linked to a peptide if it 
functions as a signal sequence, participating in the secretion of the mature form of the protein most 
probably involving cleavage of the signal sequence. A promoter is operably linked to a coding sequence if it 
controls the transcription of the sequence; a ribosome binding site is operably linked to a coding sequence 
is if it is positioned so as to permit translation. 

"Prohydrolase" refers to a hydrolase which contains additional N-terminal amino acid residues which 
render the enzyme inactive but, when removed, yield an enzyme. Many proteolytic enzymes are found in 
nature as translational proenzyme products and, in the absence of post-translational products are 
expressed in this fashion. 

20 "Presequence" refers to a signal sequence of amino acids bound to the N-terminal portion of the 
hydrolase which may participate in the secretion of the hydrolase. Presequences also may Be modified in 
the same fashion as is described here, including the introduction of predetermined mutations. When bound 
to a hydrolase, the subject protein becomes a "prehydrolase". Accordingly, relevant prehydrolase for the 
purposes herein are presubtilisin and preprosubtilisin. Prehydrolases are produced by deleting the "pro" 

25 sequence (or at least that portion of the pro sequence that maintains the enzyme in its inactive state) from a 
prepro coding region, and then expressing the prehydrolase. In this way the organism excretes the active 
rather than proenzyme. 

The cloned carbonyl hydrolase is used to transform a host cell in order to express the hydrolase. This 
will be of interest where the hydrolase has commercial use in its unmodified form, as for example subtilisin 

30 rn laundry products as noted above. In the preferred embodiment the hydrolase gene is ligated in a high 
copy number plasmid. This plasmid replicates in hosts in the sense that it contains the well-known 
elements necessary for plasmid replication: a promoter operably linked to the gene in question (which may 
be supplied as the gene's own homologous promotor if it is recognized, i.e., transcribed, by the host), a 
transcription termination and polyadenylation region (necessary for stability of the mRNA transcribed by 

35 the host from the hydrolase gene) which is exogenous or is supplied by the endogenous terminator region 
of the hydrolase gene and, desirably, a selection gene such as an antibiotic resistance gene that enables 
continuous cultural maintenance of plasm id-infected host cells by growth in antibiotic-containing media. 
High copy number plasmids also contain an origin of replication for the host, thereby enabling large 
numbers of plasmids to be generated in the cytoplasm without chromosonal limitations. However it is 

40 within the scope herein to integrate multiple copies of the hydrolase gene into host genome. This is 
facilitated by bacterial strains which are particularly susceptible to homologous recombination. The 
resulting host cells are termed recombinant host cells. 

Once the carbonyl hydrolase gene has been cloned, a number of modifications are undertaken to 
enhance the use of the gene beyond synthesis of the wild type or precursor enzyme. A precursor enzyme is 

45 the enzyme prior to its modification as described in this application. Usually the precursor is the enzyme as 
expressed by the organism which donated the DNA modified in accord herewith. The term "precursor" is 
to be understood as not implying that the product enzyme was the result of manipulation of the precursor 
enzyme per se. 

In the first of these modifications, the gene may be deleted from a recombination positive (reO 
so organism containing a homologous gene. This is accomplished by recombination of an in vitro deletion 
mutation of the cloned gene with the genome of the organism. Many strains of organisms such as £ colt 
and Bacillus are known to be capable of recombination. AJI that is needed is for regions of the residual DNA 
from the deletion mutant to recombine with homologous regions of the candidate host. The deletion may 
be within the coding region (leaving enzymatically inactive polypeptides) or include the entire coding 
55 region as long as homologous flanking regions (such as promoters or termination regions) exist in the 
host. Acceptability of the host for recombination deletion mutants is simply determined by screening for 
the deletion of the transformed phenotype. This is most readily accomplished in the case of carbonyl 
hydrolase by assaying host cultures for loss of the ability to cleave a chromogenic substrate otherwise 
hydrolyzed by the hydrolase. 

so Transformed hosts containing the protease deletion mutants are useful for synthesis of products which 
are incompatible with proteolytic enzymes. These hosts by definition are incapable of excreting the deleted 
proteases described herein, yet are substantially normally sporulating. Also the other growth 
characteristics of the transformants are substantially like the parental organism. Such organisms are useful 
in that it is expected they will exhibit comparatively less inactivation of heterologous proteins than the 

65 parents, and these hosts do have growth characteristics superior to known protease-deficient organisms. 
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However, the deletion of neutral protease and subtilisin as described in this application does not remove all 
of the proteolytic activity or Bacillus. It is believed that intracellular proteases which are not ordinarily 
excreted extracellularly "leak" or diffuse from the cells during late phases of the culture. These intracellular 
proteases may or may not be subtilisin or neutral protease as those enzymes are defined herein 
5 Accordingly, the novel Bacillus strains herein are incapable of excreting the subtilisin and/or neutral 
protease enzymes which ordinarily are excreted extracellularly in the parent strains. "Incapable" means 
not revertible to the wild type. Reversion is a finite probability that exists with the heretofore known 
protease-deficient, naturally occurring strains since there is no assurance that the phenotype of such 
strains is not a function of a readily revertible mutation e.g. a point mutation. This to be contrasted with the 
w extremely large deletions provided herein. 

The deletion mutant-transformed host cells herein are free of genes encoding enzymatically active 
neutral protease or subtilisin, which genes are defined as those being substantially homologous with the 
genes set forth in Figs. 1, 7 or 10. "Homologous" genes contain coding regions capable of hybridizing 
under high stringency conditions with the genes shown in Figs. 1, 7 or 10. 
ts The microbial strains containing carbonyl hydrolase deletion mutants are useful in two principal 
processes. In one embodiment they are advantageous in the fermentative production of products 
ordinarily expressed by a host that are desirably uncontaminated with the protein encoded by the deletion 
gene. An example is fermentative synthesis of amylase, where contaminant proteases interfere in many 
industrial uses for amylase. The novel strains herein relieve the art from part of the burden of purifying 
20 such products free of contaminating carbonyl hydrolases. 

In a second principal embodiment, subtilisin and neutral protease deletion-mutant strains are useful in 
the synthesis of protein which is not otherwise encoded by the strain. These proteins will fail within one of 
two classes. The first class consists of proteins encoded by genes exhibiting no substantia! 
pretransformation homology with those of the host. These may be proteins from other procaryotes but 
25 ordinarily are eucaryotic proteins from yeast or higher eucaryotic organisms, particularly mammals. The 
novel strains herein serve as useful hosts for expressible vectors containing genes encoding such proteins 
because the probability for proteolytic degradation of the expressed, non-homologous proteins is reduced. 

The second group consists of mutant host genes exhibiting substantial pretransformation homology 
with those of the host. These include mutations of procaryotic carbonyl hydrolases such as subtilisin and 
neutral protease, as well as microbial (rennin, for example rennin from the genus Mucor). These mutants 
are selected in order to improve the characteristics of the precursor enzyme for industrial uses. 

A novel method is provided to facilitate the construction and identification of such mutants. First, the 
gene encoding the hydrolase is obtained and sequenced in whole or in part. Then the sequence is scanned 
for a point at which it is desired to make a mutation (deletion, insertion or substitution) of one or more 
amino acids in the expressed enzyme. The sequences flanking this point are evaluated for the presence of 
restriction sites for replacing a short segment of the gene with an oligonucleotide pool which when 
expressed will encode various mutants. Since unique restriction sites are generally not present at locations 
within a convenient distance from the selected point (from TO to 15 nucleotides), such sites are generated 
by substituting nucleotides in the gene in such a fashion that neither the reading frame nor the amino acids 
40 encoded are changed in the final construction. The task of locating suitable flanking regions and evaluating 
the needed changes to arrive at two unique restriction site sequences is made routine by the redundancy of 
the genetic code, a restriction enzyme map of the gene and the large number of different restriction 
enzymes. Note that if a fortuitous flanking unique restriction site is available, the above method need be 
used only in connection with the flanking region which does not contain a site. 
45 Mutation of the gene in order to change its sequence to conform to the desired sequence is 

accomplished by M13 primer extension in accord with generally known methods. Once the gene is cloned, 
it is digested with the unique restriction enzymes and a plurality of end termini-complementary 
oligonucleotide cassettes are ligated into the unique sites. The mutagenesis is enormously simplified by 
this method because all of the oligonucleotides can be synthesized so as to have the same restriction sites, 
so and no synthetic linkers are necessary to create the restriction sites. 

The number of commercially available restriction enzymes having sites not present in the gene of 
interest is generally large. A suitable DNA sequence computer search program simplifies the task of finding 
potential 5' and 3' unique flanking sites. A primary constraint is that any mutation introduced in creation of 
the restriction site must be silent to the final constructed amino acid coding sequence. For a candidate 
55 restriction site 5' to the target codon a sequence must exist in the gene which contains at least all the 
nucleotides but for one in the recognition sequence 5' to the cut of the candidate enzyme. For example, the 
blunt cutting enzyme Smal (CCC/GGG) would be a 5' candidate if a nearby 5' sequence contained NCC, 
CNC, or CCN. Furthermore, if N needed to be altered to C this alteration must leave the amino acid coding 
sequence intact. In cases where a permanent silent mutation is necessary to introduce a restriction site one 
60 may want to avoid the introduction of a rarely used codon. A similar situation for Smal would apply for 3' 
flanking sites except the sequence NGG, GNG, or GGN must exist. The criteria for locating candidate 
enzymes is most relaxed for blunt cutting enzymes and most stringent for 4 base overhang enzymes. In 
general many candidate sites are available. For the codon-222 target described herein a Ball site (TGG/CCA) 
could have been engineered in one base pair 5' from the Kpni site. A 3' EcoRV site (GAT/ATC) could have 
65 been employed 11 base pairs 5' to the Pstl site, A cassette having termini ranging from a blunt end up to a 
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four base-overhang will function without difficulty. In retrospect, this hypothetical EcoRV site would have 
significantly shortened the oligonucleotide cassette employed (9 and 13 base pairs) thus allowing greater 
purity and lower pool bias problems. Flanking sites should obviously be chosen which cannot themselves 
ligate so that ligation of the oligonucleotide cassette can be assured in a single orientation 

5 The mutation perse need not be predetermined. For example, an oligonucleotide cassette or fragment 

is randomly mutagenized with nitrosoguanidine or other mutagen and then in turn ligated into the 
hydrolase gene at a predetermined location. 

The mutant carbonyl hydrolases expressed upon transformation of the suitable hosts are screened for 
enzymes exhibiting desired characteristics, e.g. substrate specificity, oxidation stability, pH-activity profiles 

io and the like. 7 K 

A change in substrate specificity is defined as a difference between the Kcat/Km ratio of the precursor 
enzyme and that of the mutant. The Kcat/Km ratio is a measure of catalytic efficiency. Procaryotic carbonyl 
hydrolases with increased or diminished Kcat/Km ratios are described in the examples Generally the 
objective will be to secure a mutant having a greater (numerically larger) Kcat/Km ratio for a q'iven 
15 substrate, thereby enabling the use of the enzyme to more efficiently act on a target substrate. An increase 
in Kcat/Km ratio for one substrate may be accompanied by a reduction in Kcat/Km ratio for another 
substrate. This is a shift in substrate specificity, and mutants exhibiting such shifts have utility where the 
precursors are undesirable, e.g. to prevent undesired hydrolysis of a particular substrate in an admixture of 
substrates, 

20 Kcat and Km are measured in accord with known procedures, or as described in Example 18 

Oxidation stability is a further objective which is accomplished by mutants described in'the examples 

The stability may be enhanced or diminished as is desired for various uses. Enhanced stability is effected 

by deleting one or more methionine, tryptophan, cysteine or lysine residues and, optionally, substituting 
™ an °^ r . ammo a ? d ^sidue not one of methionine, tryptophan, cysteine or lysine. The opposite 
25 substitutions result in diminished oxidation stability. The substituted residue is preferably alanyl but 

neutral residues also are suitable, 

Mutants are provided which exhibit modified pH-activity profiles. A pH-activity profile is a plot of pH 

against enzyme activity and may be constructed as illustrated in Example 19 or by methods known in the 

art. It may be desired to obtain mutants with broader profiles, i.e., those having greater activity at certain 
30 pH than the precursor, but no significantly greater activity at any pH, or mutants with sharper profiles i e 

those having enhanced activity when compared to the precursor at a given pH, and lesser activity 

elsewhere. 

The foregoing mutants preferably are made within the active site of the enzyme as these mutations are 
most likely to influence activity. However, mutants at other sites important for enzyme stability of 

35 conformation are useful. In the case of Bacillus subtilisin or its pre, prepro and pro forms, mutations at 
tyrosine-1, aspartate+32, asparagine+155, tyrosine+104, methionine+222, glycine+166 histidine+64 
glycine+169, phenylalanine+189, serine+33, serine+221, tyrosine+217, glutamate+156 and/or 
aianme+152 produce mutants having changes in the characteristics described above or in the processing 
of the enzyme. Note that these amino acid position numbers are those assigned to B. amyloliquefaciens 

40 subtilisin as seen from Fig. 7. It should be understood that a deletion or insertion in the N-terminal direction 
from a given position will shift the relative amino acid positions so that a residue will not occupy its original 
or wild type numerical position. Also, allelic differences and the variation among various procaryotic 
species will result in positions shifts, so that position 169 in such subtilisins will not be occupied by glycine. 
In such cases the new positions for glycine will be considered equivalent to and embraced within the 

45 designation glycine+169. The new position for glycine+169 is readily identified by scanning the subtilisin 
in question for a region homologous to glycine+169 in Fig. 7. 

One or more, ordinarily up to about 1 0, amino acid residues may be mutated. However, there is no limit 
to the number of mutations that are to be made aside from commercial practicality. 

The enzymes herein may be obtained as salts. It is clear that the ionization state of a protein will be 

so dependent on the pH of the surrounding medium, if it is in solution, or of the solution from which it is 
prepared, it is in solid form. Acidic proteins are commonly prepared as, for example, the ammonium 
sodium, or potassium salts; basic proteins as the chlorides, sulfates, or phosphates. Accordingly, the 
present application includes both electrically neutral and salt forms of the designated carbonyl hydrolases, 
and the term carbonyl hydrolase refers to the organic structural backbone regardless of ionization state. 

55 The mutants are particularly useful in the food processing and cleaning arts. The carbonyl hydrolases, 
including mutants, are produced by fermentation as described herein and recovered by suitable 
techniques. See for example K. Anstrup, 1974, Industrial Aspects of Biochemistry, ed. B. Spencer pp. 
23—46. They are formulated with detergents or other surfactants in accord with methods known per se for 
use in industrial processes, especially laundry. In the latter case the enzymes are combined with 

60 detergents, builders, bleach and/or fluorescent whitening agents as is known in the art for proteolytic 
enzymes. Suitable detergents include linear alkyl benzene sulfonates, atkyl ethoxylated sulfate, sulfated 
linear alcohol or ethoxylated linear alcohol. The compositions may be formulated in granular or liquid 
form. See for example U.S. Patents, 3,623,957, 4,404,128; 4,381,247; 4,404,115; 4,318,818; 4,261,868" 
4,242,219; 4,142,999; 4,111,855; 4,011,169; 4,090,973; 3,985,686; 3,790,482; 3,749,671; 3,560,392 : 

65 3,558,498; and 3,557,002. 
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The following disclosure is intended to serve as a representation of embodiments herein, and should 
not be construed as limiting the scope of this application. 

Glossary of Experimental Manipulations 
5 In order to simplify the Examples certain frequently occurring methods will be referenced by shorthand 

phrases. 

Plasmids are designated by a small p preceded and/or followed by capital letters and/or numbers. The 
starting plasmids herein are commercially available, are available on an unrestricted basis, or can be 
constructed from such available plasmids in accord with published procedures. 
10 "Klenow treatment" refers to the process of filling a recessed 3' end of double stranded DNA with 
deoxyribonucleotides complementary to the nucleotides making up the protruding 5' end of the DNA 
strand. This process is usually used to fill in a recessed end resulting from a restriction enzyme cleavage of 
DNA. This creates a blunt or flush end, as may be required for further ligations. Treatment with Klenow is 
accomplished by reacting (generally for 15 minutes at 15°C) the appropriate complementry 
is deoxyribonucleotides with the DNA to be filled in under the catalytic activity (usually 10 units) of the 
Klenow fragment of E. coii DNA polymerase I ("Klenow"). Klenow and the other reagents needed are 
commercially available. The procedure has been published extensively. See for example T. Maniatis et aL, 
1932, Molecular Cloning, pp. 107—108. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with an enzyme that acts only at certain 

20 locations in the DNA. Such enzymes are called restriction enzymes, and the sites for whicb each is specific 
is called a restriction site. "Partial" digestion refers to incomplete digestion by a restriction enzyme, i.e., 
conditions are chosen that result in cleavage of some but not ail of the sites for a given restriction 
endonuclease in a DNA substrate. The various restriction enzymes used herein are commercially available 
and their reaction conditions, cofactors and other requirements as established by the enzyme suppliers 

25 were used. Restriction enzymes commonly are designated by abbreviations composed of a capital letter 
followed by other letters and then, generally, a number representing the microorganism from which each 
restriction enzyme originally was obtained. In general, about 1 (jg of plasmid or DNA fragment is used with 
about 1 unit of enzyme in about 20 pi of buffer solution. Appropriate buffers and substrate amounts for 
particular restriction enzymes are specified by the manufacturer. Incubation times of about 1 hour at 37°C 

30 are ordinarily used, but may vary in accordance with the supplier's instructions. After incubation, protein is 
removed by extraction with phenol and chloroform, and the digested nucleic acid is recovered from the 
aqueous fraction by precipitation with ethanol. Digestion with a restriction enzyme infrequently is followed 
with bacteria! alkaline phosphatase hydrolysis of the terminal 5' phosphates to prevent the two restriction 
cleaved ends of a DNA fragment from "circularizing" or forming a closed loop that would impede insertion 

35 of another DNA fragment at the restriction site. Unless otherwise stated, digestion of plasmids is not 
followed by 5' terminal dephosphorylation. Procedures and reagents for dephosphorylation are 
conventional (T. Maniatis eta/., Id., pp. 133 — 134). 

"Recovery" or "isolation" of a given fragment of DNA from a restriction digest means separation of the 
digest on 6 percent polyacrylamide gel electrophoresis, identification of the fragment of interest by 

40 molecular weight (using DNA fragments of known molecular weight as markers), removal of the gel section 
containing the desired fragment, and separation of the gel from DNA. This procedure is known generally. 
For example, see R. Lawn et aL, 1981, "Nucleic Acids Res." 5:6103—6114, and D. Goeddei et a/., (1980) 
"Nucleic Acids Res." 8:4057. , 

"Southern Analysis" is a method by which the presence of DNA sequences in a digest or DNA- 

45 containing composition is confirmed by hybridization to a known, labelled oligonucleotide or DNA 
fragment. For the purposes herein, Southern analysis shall mean separation of digests on 1 percent 
agarose and depurination as described by G. Wahl et a/., 1979, "Proc. Nat Acad. Sci. U.S.A." 
75:3683—3687, transfer to nitrocellulose by the method of E. Southern, 1975, "J. Mol. Biol." 55:503—517, 
and hybridization as described by T. Maniatis et aL, 1978, "Cell" 75:687—701. 

so "Transformation" means introducing DNA into an organism so that the DNA is replicable, either as an 

extrachromosomal element or chromosomal integrant. Unless otherwise stated, the method used herein 
for transformation of E. coll is the CaCI 2 method of Mandel et aL, 1970, "J. Mol. Biol." 55:154, and for 
Bacillus, the method of Anagnostopolous et a/., 1961, "J. Bact." 57:791 — 746. 

"Ligation" refers to the process of forming phosphodiester bonds between two double stranded 

55 nucleic acid fragments (T. Maniatis eta/., Id., p. 146). Unless otherwise stated, ligation was accomplished 
using known buffers and conditions with 10 units of T4 DNA ligase ("ligase") per 0.5 ^g of. approximately 
equimolar amounts of the DNA fragments to be ligated. Plasmids from the transformants were prepared, 
analyzed by restriction mapping and/or sequenced by the method of Messing, et aL, 1981, "Nucleic Acids 
Res." 9:309. 

so "Preparation" of DNA from transformants means isolating plasmid DNA from microbial culture. 
Unless otherwise stated, the alkaline/SDS method of Maniatis et aL, Id. p. 90., was used. 

"O! igonucleotides" are short length single or double stranded polydeoxynucleotides which were 
chemically synthesized by the method of Crea et aL, 1980, "Nucleic Acids Res." 5:2331—2348 (except that 
mesitylene nitrotriazole was used as a condensing agent) and then purified on polyacrylamide gels. 

55 All literature citations are expressly incorporated by reference. 
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Example 1 

Preparation of a Genomic DNA Library from B. amyloliquifaciens and Isolation of its Subtilisin Gene 

The known amino acid sequence of the extracellular B. amyioliquefaciens permits the construction of a 
suitable probe mixture. The sequence of the mature subtilisin is included {along with the additional 
information contributed by the present work) in Figure 1. All codon ambiguity for the sequence of amino 

! C A?ci* a h£^°A 121 iS C0V6red by 3 p0o! of ei 9 ht oligonucleotides of the sequence 

AA( T J AA( T )ATGG A( T )GT. 

^ Chromosomal DNA isolated from B. amyioliquefaciens (ATCC No. 23844) as described by J Marmur 
Mo1 ' B,oL "' 3:2QQ > was Partially digested by Sau 3A, and the fragments size selected and ligated into the 
BamH 1 site of dephosphorlated pBS42. <pBS42 is shuttle vector containing origins of replication effective 
both in £. coh and Bacillus. It is prepared as described in Example 4). The Sau3A fragment containing 
vectors were transformed into £ coli K12 strain 294 (ATCC No. 31446) according to the method of M 
Mandel, et aL, 1970, "J. Mol. Bio." 53: 154 using 80—400 nanograms of library DNA per 250 ul of competent 
cells. 

Cells from the transformation mixture were plated at a density of 1—5 x 1 0 3 transformants per 1 50 mm 
plate containing LB medium + 12.5 pg/ml chloroamphenicol, and grown overnight at 37°C until visible 
colonies appeared. The plates were then replica plated onto BA85 nitrocellulose filters overlaid on LB/ 
chloramphenicol plates. The replica plates were grown 10—12 hours at 37°C and the filters transferred to 
fresh plates containing LB and 150 pg/ml spectinomycin to amplify the plasmid pool. 

After overnight incubation at 37°C, filters were processed essentially as described by Grunstein and 
Hogness, 1975, "Proc. Natl. Acad. Sci. (USA)" 72: 3961. Out of approximately 20,000 successful 
transformants, 25 positive colonies were found. Eight of these positives were streaked to purify individual 
clones. 24 clones from each streak were grown in microliter wells, stamped on to two replica filters and 
P ? 1 J2 d ? scrfbed ab0VG with Gither AACAA(?)ATGGA( T c )GT(pool 1) or AA7AA(?)ATGGA< T c )GT(poo! 2) 
which differ by only one nucleotide. As shown in Figure 2, pool 1 hybridized to a much greater extent to all 
positive clones than did pool 2, suggesting specific hybridization. 

Four out of five miniplasmid preparations (Maniatis et aL, Id.) from positive clones gave identical 
restriction digest patterns when digested with Sau3A or Hindi. The plasmid isolated from one of these four 
identical colonies by the method of Maniatis et aL, Id., had the entire correct gene sequence and was 
designated pS4. The characteristics of this plasmid as determined by restriction analysis are shown in 
Figure 3. 



Example 2 

Expression of the Subtilisin Gene 

35 Bacillus subtilis 1-168 (Catalog No. 1-A1, Bacillus Genetic Stock Center) was transformed with pS4 and a 
single chloramphenicol resistant transformant then grown in minimal medium. After 24 hours, the culture 
was centnfuged and both the supernatant (10—200 pi) and pellet assayed for proteolytic activity by 
measuring the change in absorbance per minute at 412 nm using 1 ml of the chromogenic substrate 
succtnyl-L-ala-ala-pro-phe-p-nitroanilide (0.2 uM) in 0.1 M sodium phosphate (pH 8.0) at 25°C A B subtilis 

40 1-168 culture transformed with pBS42 used as a control showed less than 1/200 of the activity shown by the 
pS4 transformed culture. Greater than 95 percent of the protease activity of the pS4 culture was present in 
the supernatant, and was completely inhibited by treatment with phenylmethylsulfonyl fluoride (PMSF) but 
not by EDTA. 

Aliquots of the supernatants were treated with PMSF and EDTA to inhibit all protease activity and 
45 analyzed by 12 percent SDS-PAGE according to the method of Laemmli, UX, 1970 "Nature", 227- 680 To 
prepare the supernatants, 16 uL of supernatant was treated with 1 mM PMSF, 10 mM EDTfo'r 10 minutes 
and boiled with 4 pL of 5x concentrated SDS sample buffer minus p-mercaptoethanol. The results of 
Coomassie stain on runs using supernatants of cells transformed with pS4, pBS42, and untransformed B 
amylohquefaaens are shown in Figure 4. Lane 3 shows authentic subtilisin from B. amyioliquefaciens 
so Lane 2 which is the supernatant from pBS42 transformed B. subtilis, does not give the 31 000 MW band 
associated with subtilisin which is exhibited by Lane 1 from pS4 transformed hosts. The approximately 
31,000 MW band result for subtilisin is characteristic of the slower mobility shown by the known M.W. 
27,500 subtilisin preparations in general. 

55 Example 3 

Sequencing of the B. amyioliquefaciens Subtilisin Gene 

The entire sequence of an EcoRI-BamHI fragment (wherein the EcoRI site was constructed by 

conversion of the Hindi site) of pS4 was determined by the method of F. Sanger, 1977, "Proc. Natl. Acad. 

Sci (USA)", 74:5463. Referring to the restriction map shown in Figure 3, the BamHI-Pvuli fragment was 
so found to hybridize with pool 1 oligonucleotides by Southern analysis. Data obtained from sequencing of 

this fragment directed the sequencing of the remaining fragments (e.g. Pvull-Hincll and Aval-Aval). The 

results are shown in Figure 1. 

Examination of the sequence confirms the presence of codons for the mature subtilisin corresponding 
to that secreted by the B. amyioliquefaciens. Immediately upstream from this sequence is a series of 107 
65 codons beginning with the GTC start codon at -107. Codon -107 to approximately codon -75 encodes an 
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amino acid sequence whose characteristics correspond to that of known signal sequences (Most such 
signal sequences are 18-30 amino acids in length, have hydrophobic cores, and ?erm nateVn a S maU 
hydrophobe am.no ac.d.) Accordingly, examination of the sequence data would indicate that codons -™7 
to approximately -75 encode the signal sequence; the remaining intervening codons ba^en -75 and -1 
5 presumably encode a prosequence. unsueiween /oana l 

Example 4 

Construction of pBS42 

in c- pB n 4 iif ^ 0rmed tnree "wav ligation of fragments derived from P UB110, P C194, and P BR322 (see 
?! fr ° m PUB11 ° is the a PP™™tely 2600 base pair fragment betwee Sail site 

ai. ,1978 J. Bacteriol. 734. 318 (1978); A. Jalanko, etal., 1981 "Gene", 74: 325. The BamHI site was tested 
with , Klenow. The P BR322 portion is the 1100 base pair fragment between the Pvull arte it 2067 Jndlhe 
Sau3A site at 3223 which contains the £ coli origin of replication: F. Bolivar, et al. 1977 Gene 2 95 J 
s Sutcliffe, 1978, Cold Spring Harbor Symposium 43: I, 77. The P C194 fragment'* then 1200 bfse pat 
fragment between the Hpall site at 973 and the Sau3A site at 2006 which contains the gene fo 

ISiZ™ ^T««n ; eS ' Stance ^Pressible in both £ coli and ft subtilis: S. Erhlich, "Proc. Natl. Acad. Sci. 
(USA) , 74:1680; S. Horynuchi etal., 1982, "J. Bacteriol." 150: 815. 

PBS42 thus contains origins of replication operable both in £ coli and in Bacillus and an expressible 
= gene for chloramphenicol resistance. exprebsioie 

Example 5 

Isolation and Sequencing of the B. subtilis Subtilisin Gene 

ft subtilis 1168 chromosomal DNA was digested with EcoRI and the fragments resolved on qel 

£Zf°/n SIS ; h V/ 19 ' 6 6 kb / ra u 9 ment hybridized to a [a-»P] CTP nick translation-labelled fragment 
obtained from the C-terminus of the subtilisin structural gene in P S4, described above. The 6 kb fragmen 
was electroluted and ligated into P BS42 whch had been digested with EcoRI and treated with bXa 

tl t n H P h h0SPha i aSe - L C °" ATCC 31446 W3S transf ° r ™d with the ligation mixture and transformants 
selected by growth on LB agar containing 12.5 ug chloramphenicol/ml. Plasmid DNA was prepared from a 
pooled suspension of 5,000 transformed colonies. This DNA was transformed into ft subtilis BG84 a 
protease deficient strain, the preparation of which is described in Example 8 below. Colonies which 
cW^ U ' e ,t Pr0 J 6 f e W t? Scree u ned b * P' atin 9 °n LB agar plus 1.5 percent w/w Carnation powdered nonfat 
skim milk and 5 ug chloramphenicol/m! (hereafter termed skim milk selection plates) and observinq for 
zones of clearance evidencing proteolytic activity. 

Plasmid DNA was prepared from protease producing colonies, digested with EcoRI, and examined by 
Southern analysis for the presence of the 6 kb EcoRI insert by hybridization to the "P-labelled C-terminus 
fragment of the subtilisin structural gene form ft amyloliquefaciens. A positive clone was identified and the 
& f m 3S T 3ted P S168 - 1 - ^ subtilis BG84 transformed with P S168.1 excreted serine protease at a 
level 5-fold over that produced in ft. subtilis 1168. Addition of EDTA to the supernatants did not affect the 
nrntUfl '-^ V" addlt ',° n ° f ^ SF <P hen Y'™thyl S ulfonyl f | uoride) t0 the supernatants reduce d 
protease activity to levels undetectable in the assay described in Example 8 for strain BG84 

th. 5 S Ctl °c map ° f the 6 - 5 kb Ec ? RI insert is show " Fig- 6- The subtilisin gene was localized to within 
% .,,£-7 - • • 1 7 1 r g ^} o Y subclonin 3 variou s restriction enzyme digests and testing for expression 
of subtilisin in ft subt,hs BG84. Southern analysis with the labelled fragment from the C-terminus of the B 
amyloliquefaciens subtilisin gene as a probe localized the C-terminus of the ft subtilis gene to within or 
part of the 631 bp Hmcll fragment B in the center of this subclone (see Fig. 6). The tandem Hindi fragments 
B, C, and D and Hincll-EcoRI fragment E (Fig. 6) were ligated into the MN13 vectors mp8 or mp9 and 

(F S U »r^^^ "p °" * 5"?' Wf; I 982 ' " Gene " '*209-276) using dideoxy chain termination 
< F - ^ n ng / r Th 5 ° C - Nat , Acad - S«. U.S.A." 74:5463-5467). The sequence of this region is shown 

7 'J heflrs 23 ammo ac ' ds are believed to be a signal peptide. The remaining 83 amino acids 
between the s.gnal sequence and the mature coding sequence constitute the putative "pro" sequence The 
overlmed I nucleot.des at the 3' end of the gene are believed to be transcription terminator regions. Two 
possible Shine-Dalgarno sequences are underlined upstream from the mature start codon. 

Example 6 

Manufacture of an Inactivating Mutation of the B. subtilis Subtilisin Gene 

A two step ligation, shown in Fig. 8, was required to construct a plasmid carrying a defective gene 
which would integrate into the Bacillus chromosome. In the first step, pS168.1, which contained the 6.5 kb 
insert originally recovered from the ft subtilis genomic library as described in Example 5 above was 
digested with EcoRI, the reaction products treated with Klenow, the DNA digested with Hindi, and the 800 
bp EcoRI-Hincll fragment E (see Fig. 6); that contains, in part, the 5' end of the ft subtilis subtilisin gene 
was recovered This fragment was ligated into pJH101 (pJHIOl is available from J. Hoch (Scripps) and is 
described by F. A. Ferrari etal., 1983, "J. Bact." 734:318-329) that had been digested with Hindi and 
treated with bacterial alkaline phosphatase. The resultant plasmid, pIDVI, contained fragment E in the 
orientation shown m Fig. 8. In the second step, pS168.1 was digested with Hindi and the 700 bp Hindi 
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fragment B, which contains the 3' end of the subtilisin gene, was recovered. pIDVl was digested at its 
unique Hindi site and fragment B ligated to the linearized pfasmid, transformed in E. coli ATCC 31 446 and 
selected on LB plates containing 12.5 \xq chloramphenicol/ml or 20 ug ampicillin/ml. One resulting'plas'mid 
designated p!DV1.4, contained fragment B in the correct orientation with respect to fragment E This 
plasmid p!DV1.4, shown in Fig. 8, is a deletion derivative of the subtilisin gene containing portions of the 5' 
and 3' flanking sequences as well. 

B. subtiiis BG77, a partial protease-deficient mutant (Prt w ") prepared in Example 8 below was 
transformed with plDVI.4. Two classes of chloramphenicol resistant (Cm r ) transformants were obtained 
Seventy-five percent showed the same level of proteases as BG77 (Prt w ") and 25 percent were almost 
completely protease deficient_(Prt ) as observed by relative zones of clearing on plates containing LB agar 
plus skim milk. The Cm r Prt transformants could not be due to a single crossover integration of the 
plasmid at the homologous regions for fragment E or B because, in such a case, the gene would be 
uninterrupted and the phenotype would be PrT' - . In fact, when either of fragments E or B were ligated 
independently into pJH101 and subsequently transformed into B. subtifls BG77, the protease deficient 
phenotype was not observed. The Cm r phenotype of Cm r Prt" plDVl.4 transformants was unstable in that 
Cm s Prt" derivatives could be isolated from Cm r Prt" cultures at a frequency of about 0.1 percent after 10 
generations of growth in minimal medium in the absence of antibiotic selection. One such derivative was 
obtained and designated BG2018. The deletion was transferred into IA84 (a BGSC strain carrying two 
auxotrophic mutations flanking the subtilisin gene) by PBS1 transduction. The derivative orqanism was 
designated BG2019. 

Example 7 

Preparation of a Genomic DNA Library from B. subtiiis and Isolation of its Neutral Protease Gene 

The partial amino acid sequence of a neutral protease of B. subtiiis is disclosed by P Levy et al 1975 
"Proc. Nat. Acad. Sci. USA" 72:4341-4345. A region of the enzyme (Asp Gin Met4 lie Tyr Gly) was selected 
from this published sequence in which the least redundancy existed in the potential codons for the amino 
acids in the region. 24 combinations were necessary to cover all the potential codinq sequences as 
described below. 



GA I CA \ ATG AT I TA C GG 
C A 



i 



Asp Gin Met He Tyr Gly 



Four pools, each containing six alternatives, were prepared as described above in Example 1 The 
pools were labelled by phosphorylization with [y- 32 P] ATP. 

The labelled pool containing sequences conforming closest to a unique sequence in a B subtiiis 
genome was selected by digesting B. subtiiis (1A72, Bacillus Genetic Stock Center) DNA with various 
restriction enzymes, separating the digests on an electrophoresis gel, and hybridizing each of the four 
probe pools to each of the blotted digests under increasingly stringent conditions until a single band was 
seen to hybridize. Increasingly stringent conditions are those which tend to disfavor hybridization eg 
increases in formamide concentration, decreases in salt concentration and increases in temperature. At 
37 C in a solution of 5x Denhardt's, 5x SSC, 50 mM NaP0 4 pH 6.8 and 20 percent formamide, only pool 4 
would hybridize to a blotted digest. These were selected as the proper hybridization conditions to be used 
tor the neutral protease gene and pool 4 was used as the probe. 

A lambda library of B. subtiiis strain BGSC 1-A72 was prepared in conventional fashion by partial 
digestion of the Bacillus genomic DNA by Sau3A, separation of the partial digest by molecular weight on an 
electrophoresis gel, elution of 15-20 kb fragments (R. Lawn er a/., 1981, "Nucleic Acids Res." 
5:6103-6114), and (igatjon of the fragments to BamHI digested charon 30 phage using a Packagene kit 
from Promega Biotec. 

E coli DPSOsupF was used as the host for the phage library, although any known host for Charon 
lambda phage is satisfactory. The E coli host was plated with the library phage and cultured, after which 
plaques were assayed for the presence of the neutral protease gene by transfer to nitrocellulose and 
screening with probe pool 4 (Benton and Davis, 1977, "Science" 196:180—182). Positive plaques were 
purified through two rounds of single plaque purification, and two plaques were chosen for further study, 
designated \NPRG1 and ANPRG2. DNA was prepared from each phage by restriction enzyme hydrolysis 
and separation on electrophoresis gels. The separated fragments were blotted and hybridized to labelled 
pool 4 oligonucleotides. This disclosed that ANPRG1 contained a 2400 bp Hindlll hybridizing fragment but 
no 4300 EcoRI fragment, while ANPRG2 contained a 4300 bp EcoRI fragment, but no 2400 bp Hindlll 
fragment. 

The 2400 bp ANPRG1 fragment was subcloned into the Hindlll site of pJH101 by the following method, 
XNPRG1 was digested by Hindlll, the digest fractionated by electrophoresis and the 2400 bp fragment 
recovered from the gel. The fragment was ligated to alkaline phosphatase-treated Hindlil digested pJH101 
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and the ligation mixture used to transform £ colt ATCC 31446 by the calcium chloride shock method of V 
Hershfield et aL, 1974, "Proc. Nat. Acad. Sci. (U.S.A.)" 75:3455—3459). Transformants were identified by 
selecting colonies capable of growth on plates containing LB medium plus 12.5 ug chloramphenicol/ml. 

Transformant colonies yielded several plasmids. The orientation of the 2400 bp fragment in each 
plasmid was determined by conventional restriction analysis (orientation is the sense reading or 
transcriptional direction of the gene fragment in relation to the reading direction of the expression vector 
into which it is ligated). Two plasmids with opposite orientations were obtained and desianated 
pNPRsubH6 and pNPRsubHI. 

The 4300 bp EcoRI fragment of XNPRG2 was subcloned into pBR325 by the method described above 
for the 2400 bp fragment except that ANPRG2 was digested with EcoRI and the plasmid was alkaline 
phosphatase-treated, EcoRI-digested pBR325. pBR325 is described by F. Bolivar, 1978, "Gene" 4:121—136. 
Two plasmids were identified in which the 4300 bp insert was present in different orientations. These two 
plasmids were designated pNPRsubRI and pNPRsubRlb. 

Example 8 

Characterization of B. subtilis Neutral Protease Gene 

The pNPRsubHI insert was sequentially digested with different restriction endonucleases and blot 
hybridized with labelled pool 4 in order to prepare a restriction map of the insert (for general procedures of 
restriction mapping see T. Mantatis et aL, Id., p. 377). A 430 bp Rsal fragment was the smallest fragment 
that hybridized to probe pool 4. The Rsal fragment was ligated into the Smal site of M13 mp8 (J. Messing et 
al. f 1982, "Gene" 75:269—276 and J. Messing in Methods in Enzymology, 1983, R. Wu et a!., Eds., 
101:20 — 78) and the sequence determined by the chain-terminating dideoxy method (F. Sanger et a!., 1977, 
"Proc. Nat Acad. Sci. U.S.A." 74:5463—5467). Other restriction fragments from the pNPRsubHI insert were 
ligated into appropriate sites in M13 mp8 or M13 mp9 vectors and the sequences determined. As required, 
dlTP was used to reduce compression artifacts (D. Mills et aL 1979, "Proc. Nat. Acad. Sci. (U.S.A.)" 
75:2232—2235). The restriction map for the pNPRsubHI fragment is shown in Fig. 9. The sequences of the 
various fragments from restriction enzyme digests were compared and an open reading frame spanning a 
codon sequence translatable into the amino and carboxyl termini of the protease (P. Levy et aL, Id.) was 
determined. An open reading frame is a DNA sequence commencing at a known point which in reading 
frame (every three nucleotides) does not contain any internal termination codons. The open reading frame 
extended past the amino terminus to the end of the 2400 bp Hindi!! fragment. The 1300 bp Bglll-Hindlll 
fragment was prepared from pNPRsubRlb (which contained the 4300 bp EcoR! fragment of ANPRG2) and 
cloned in M13 mp8. The sequence of this fragment, which contained the portion of the neutral protease 
leader region not encoded by the 2400 bp fragment of pNPRsubHI, was determined for 400 nucleotides 
upstream from the Hindi II site., 

The entire nucleotide sequence as determined for this neutral protease gene, including the putative 
secretory leader and prepro sequence, are shown in Fig. 10. The numbers above the line refer to amino acid 
positions. The underlined nucleotides in Fig. 10 are believed to constitute the ribosome binding (Shine- 
Dalgarno) site, while the overlined nucleotides constitute a potential hairpin structure presumed to be a 
terminator. The first 27 — 28 of the deduced amino acids are believed to be the signal for the neutral 
protease, with a cleavage point at ala-27 or afa-28. The "pro" sequence of a proenzyme structu re extends to 
the amino-terminal amino acid (a!a-222) of the mature, active enzyme. 

A high copy plasmid carrying the entire neutral protease gene was constructed by (Fig. 1 1 ) ligating the 
BGIII fragment of pNPRsubRI, which contains 1900 bp (Fig. 9), with the Pvull-Hindlll fragment of 
pNPRsubHI, which contains 1400 bp. pBS42 (from Example 4) was digested with BamHI and treated with 
bacterial alkaline phosphatase to prevent plasmid rectrcularization. pNPRsubRI was digested with Bglll, 
the 1900 bp fragment was isolated from gel electrophoresis and ligated to the open BamHI sites of pBS42. 
The ligated plasmid was used to transform E co/i ATCC 31446 by the calcium chloride shock method 
(V. Heshfield et aL Id.), and transformed cells selected by growth on plates containing LB medium with 1 2.5 
ug/ml chloramphenicol. A plasmid having the Bgl II fragment in the orientation shown in Fig. 11 was 
isolated from the transformants and designated pNPRsubBI. pNPRsubBI was digested (linearized) with 
EcoRI, repaired to flush ends by Klenow treatment and then digested with Hindlll. The larger fragment from 
the Hindlll digestion (containing the sequence coding for the amino terminal and upstream regions) was 
recovered. 

The carboxyl terminal region of the gene was supplied by a fragment from pNPRsubHI, obtained by 
digestion of pNPRsubHI with Pvuli and Hindlll and recovery of the 1400 bp fragment. The flush end Pvull 
and the Hindlll site of the 1400 bp fragment was ligated, respectively, to the blunted EcoRI and the Hindlll 
site of pNPRsubBI, as shown in Fig. 1 1. This construct was used to transform B. subtilis strain BG84 which 
otherwise excreted no proteolytic activity by the assays described below. Transformants were selected on 
plates containing LB medium plus 1.5 percent carnation powdered nonfat milk and 5 ug/ml 
chloramphenicol. Plasmids from colonies that cleared a large halo were analyzed. Plasmid pNPR10, 
incorporating the structural gene and flanking regions of the neutral protease gene, was determined by 
restriction analysis to have the structure shown in Fig. 11. 

B. subtilis strain BG84 was produced by N-mehyl-N'-nitro-N-nitrosoguanidine (NTG) mutagenesis of 
B. subtilis 1168 according to the general technique of Adelberg et aL, 1965, "Biochem. Biophys. Res. 
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Commun." 75:78&~795. Mutagenized strain 1168 was plated on skim milk plates (without antibiotic) 
Colon.es producing a smaller halo were picked for further analysis. Each colony was T*a«*£rt£'r 
protease production on skim milk plates and amylase production on starch plates. One such isolate which 
was partially protease deficient, amylase positive and capable of spoliation, was designated BG^ r The 
Z rTnn deflC ' enCy H mut ?*° H n ™* desi 9nated prt-77. The prt-77 allele was moved to a spoOA background 
by congression as described below to produce strain BG84, a sporulation deficient strain. 

Table A 

Strain R e i evan t Genotype origin 

1168 trpC2 

JH703 trpC2, pheA12 , s P oOAa677 Trousdale et aK* 

BG16 £urB6, metBS, 1euA8 , lys-21 , hisA , thr-S Pb 1665 
sacA321 



BG77 trpC2, prt-77 NTG x 1168 

25 BG81 metB5 > prt-77 BG16 DNA x BG77 

BG84 s P o0a677 , prt-77 JH703 DNA x BG81 



* "Mol • Gen, Genetics" 173:61 (1979) 



_ Example 9 

Expression of the Neutral Protease Gene 



— assess ^ississssss^^rs^ wi,h °-' <~— 



55 



60 



65 



., . Example 10 

Manufacture of an Inactivating Mutation of the Neutral Protease Gene 

J*? ^ Rsal bound ed regions in the 2400 bp insert of pNPRsubHI, totalling 527 bp can be deleted in 
order to produce an uncomplete structural gene. The translational products of this gene are "enzwSaTcallC 
S A plasmid having this deletion was constructed as follows. pJH101 was cleaved by digeS with 
Hindlll and treated with bacterial alkaline phosphatase. The fragments of the neutral protease f aene tn Z 

i^ciiryiTMT. we H srr i by digestin9 pNPR.«bm^HKr."dTi to .s 

recovering the 1200 bp Hindlll-Rsal and 680 bp Rsal-Hindlll fragments by qel electroDhoresis ThP^P 
fragments were hgated into linearized P JH101 and used to transform £ c^Altt M^a^™^ 
were selected on plates containing LB medium and 20 ug ampicillin/ml. Plasmids were recovere J C the 
n flT t T d bY reStriCti ° n en2Yme ana,VSis 10 iden,if V * Plasmid having ,h > Zc foments 
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Example 1 1 

Replacement of the Neutral Protease Gene with a Deletion Mutant 

Plasmid pNPRsubHIA was transformed into B. subtills strain BG2019 (the subtilisin deleted mutant 
from Example 6) and chromosomal integrants were selected on skim milk plates. Two types of Cm' 
transformants were noted, those with parental levels of proteolysis surrounding the colony, and those with 
almost no zone of proteolysis. Those lacking a zone of proteolysis were picked, restreaked to purify 
individual colonies, and their protease deficient character on skim milk plates confirmed. One of the Cm r , 
proteolysis deficient colonies was chosen for further studies (designated BG2034). Spontaneous Cm" 
revertants of BG2034 were isolated by overnight growth in LB media containing no Cm, plating for 
individual colonies, and replica plating on media with and without Cm. Three Cm' revertants were isolated, 
two of which were protease proficient, one of which was protease deficient (designated BG2036)! 
Hybridization analysis of BG2036 confirmed that the plasmid had been lost from this strain, probably by 
recombination, leaving only the deletion fragments of subtilisin and neutral protease. 

Example 12 

Phenotype of Strains Lacking Functional Subtilisin and Neutral Protease 

The growth, sporulation and expression of proteases was examined in strains lacking a functional gene 
for either the neutral or alkaline protease or both. The expression of proteases was examined by a zone of 
clearing surrounding a colony on a skim milk plate and by measurement of the protease levels in liquid 
culture supernatants {Table B), A strain (BG2035) carrying the subtilisin gene deletion, arid showed a 30 
percent reduction level of protease activity and a normal halo on milk plates. Strain BG2043, carrying the 
deleted neutral protease gene and active subtilisin gene, and constructed by transforming BG16 (Ex. 8) with 
DNA from BG2036 (Example 1 1 ), showed an 80 percent reduction in protease activity and only a small halo 
on the milk plate. Strain BG2054, considered equivalent to BG2036 (Example 11) in that it carried the 
foregoing deletions in both genes, showed no detectable protease activity in this assay and no detectable 
halo on milk plates. The deletion of either or both of the protease genes had no apparent effect on either 
growth or sporulation. Strains carrying these deletions had normal growth rates on both minimal glucose 
and LB media. The strains sporulated at frequencies comparable to the parent strain BG16. Examination of 
morphology of these strains showed no apparent differences from strains without such deletions. 



Table B 

Effect of protease deletions on protease expression and sporulation. 



Genotype 3 Protease activity* * Percent Sporulation 



BG16 


Wild type 


100 


40 


BG2035 


apra684 


70 


20 


BG2043 


nprE^522 


20 


20 


BG2054 


aprA684,nprEA522 


ND 


45 


BG84(pBS42) 


spoOAA677,prt-77 


ND 




BG84(pMPR10") 


spo0Aa677,prt-77 


3000 





a 0nly the loci relevant to the protease phenotype are shown, 

b Protease activity is espressed in arbitrary units, BG16 was assigned a 

level of 100, ND indicates the level of protease was not detectable in 
the assay used. 



Example 13 

Site-specific Saturation Mutagenesis of the B. Amyloliquefaciens Subtilisin Gene at Position 222- 
Preparation of the Gene for Cassette insertion 

pS4-5, a derivative of pS4 made according to Wells ef aL, "Nucleic Acids Res.", 1983, 7 /:791 1—7924 
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was digested with EcoRI and BamHI, and the 1.5 kb EcoRI-BamHI fragment recovered This fraament was 

"jtolT r- P '£? Z f ° r Z a M ;l 3 WhiCh h3d b6en di9eSted with EcoRI and BamHI Sange Tat 980 
J Mol Biol. 143 161-178. Mess.ng eta/., 1981. "Nucleic Acids Research" 9, 304-321 Messina J anH 

Xn^i <198 ^ G6 , ne W - 2 ,?7 276) - The M ' 13 mp9 Ph39e ligations ' desi 9" ated M -^p9 sKt were usea 

f» ^ r JM1 ° 1 Sin9le Strand6d Ph3 9 e DNA was P re P ar * d from a two mL overnight 

culture. An oligonucleotide primer was synthesized having the sequence overnignt 

5'-GTACAACGGTACCTCACGCACGCTGCAGGAGCGGCTGC-3'. 

Sthl^n'h 0 ". 0 ^ 3 10 ? 6 se q. uence ° f the subtili * gene fragment encoding amino acids 21 6-232 except 
!nH £« 1 P ? d °;f f °7T° aC ' dS 222-225 Were del6ted ' and the codons f °' amino acids 220 227 
no! 2 «T p mU 1 , f e c k mtr0d , lJCe 3 , Kpn ' Site 5 ' t0 ,hG met - 222 codon and a Pstl 3' to the mS+222 
™ B ™ e th °' Sub t St,tuted nucl «oti d es are denoted by asterisks, the underlined codonsta Hne 2 
represent the new restriction sites and the scored sequence in line 4 represents the inserted 
oligonucleotides The primer (about 15 uM> was labelled with p»p] by incubation with [X]-ATP 10 uUn 
20 uL react.onHAmersham 5000 Ci/mmol, 10218) and T 4 polynucleotide kinase (10 unitsT followed by non 

inS Th k T [ (10 ° ^ t0 . a "° W C ° mplete P h ° s Ph°ry'ation of the mutagene is primer Th kinase was" 
inactivated by heating the phosphorylation mixture at 68°C for 15 min 

Res « » BIO^m^ K di l ed - t0 M 3 T p9 , S ^ BT as modified from"Norris et a!., 1983, "Nucleic Acids 
Res. 7 7, 5103-51 12 by combining 5 uL of the labelled mutagenesis primer (~3 uM), ~1 ug M-13 mp9 SUBT 
template 1 pL of 1 uM M-13 sequencing primer (17-mer), and 2.5 [iL of buffer (0 3 M Tris'pH^ 40 m M 

cnfi n m ? : 10 mM DTT ' °- 5 mQ/m ' BSA) - The mixture was heated to 68-C for io minutes Tnd 
SS hat? lnu ^itrao"l temperature. To the annealing mixture was added 3.6 uL of 0.25 mM dGTP 
dCTP, d ATP, and dTTP, 1.25 u.L of 10 mM ATP, 1 uL ligase (4 units) and 1 uL Klenow (5 unte The primer 
K H d I 1931 ' 00 reaCti ° n <t0ta ' V °' Ume 25 M,) P r ° c ^ded 2 hours at 14°C. The Klenow and I gase mre 

nc T^^XSSSS 0 20 Th8 B heated reaCti ° n mixtUre Was dl "9 ested ™* BamHI and E^oRI 
and an ahquot of the digest was applied to a 6 percent polyacrylamide gel and radioactive fraqments were 

Stoth'Sr ^T r Mi 0 f 9raphY - THiS Sh ° Wed the [32R] "genesis primer had indeed been incorporated 
into the EcoRI-BamHI fragment containing the now mutated subtilisin gene wrporaiea 

1 m h c e n r 5 mamder of } he digested reaction mixture was diluted to 200 uLwith 10 mM Tris, pH 8 containina 
1 mM EDTA, extracted once with alrl (v:v) phenol/chloroform mixture then once with ch oroform and he 
aqueous phase recovered. 15 uL of 5M ammonium acetate (pH 8) was added along with two vZmes of 
ethanol to precipitate the DNA from the aqeuous phase. The DNA was pelleted by cen rifuTat^n for five 
m a .. m t ,C ^ fU9e t nd th » ! u P«"«»nt was discarded. 30 uL of 70 percent ethanol was added to wash 
the DNA pellet, the wash was discarded and the pellet lyophilized 

PBS42 from example 4 above was digested with BamHI and EcoRI and purified on an acrylamide ael to 
Son buffer The S'^ 9 ° f ^ digeSted vecto ^0 uM ATP and 6 units lig'ase were dissolvedTn 20^ of 
igation buffer. The ligation went overnight at 14'C. The DNA was transformed into £ CO //294 rec* and the 

nZ^f ST,:" 4 mi ,°, fLB medium COntainin 9 12 ' 5 W /ml chloramphenicol. Plasm d DNA was 

showed 30^5 lerlT n a f nd th l96Ste , d T Kpnl ' EC ° R ' 3nd BamHL Anal * sis ° f the ^triction fragment 
showed 30-50 percent of the molecules contained the expected Kpnl site programmed by the 

fTom g Mi3 «Kr; h ? that th f P ' aSmid p0pu,ation not i ncludin 9 "he 9 K P nl site resulted 

SSlUlo o KSTnd &S^H - ePa,r ° f t^^^^enesis site, thus producing a heterogenous 
tt nS * nS?T P,asm,ds ,n some of the transformants. In order to obtain a pure culture of 
tt , ?\ aS nvk ° NA was t t ransformed a second time into £ c 0 // to clone plasmids containing the 
Kpnl site. PrePa ° m 16 SUCh transformants and were found to contain the expected 

r.S^ a n!^J ° f WSre mad8 fr0m ° ne 0f these six transformants (designated pA222) and 

restrict ion analysis confirmed the presence and location of the expected Kpnl and Pstl sites 40 uq of oA222 
were Rested in 300 uL of Kpnl buffer plus 30 uL Kpnl (300 units) for 1.5 h at 37'C The DNA was 

2 P 00 uL H Si? h" TT* T^rV^r ethan °'' and ly0philized - The DNA P°»* was taken uptn 
IZartlXZ Z f ^,' geS i e i W, Ji h 20 ML (50 ° Units) Pstl for 1 - 5 h ^ 37'C. The aqueous phase was 
S wlt Phenol/CHCI, and the DNA precipitated with ethanol. The DNA was dissolved in water and 

w n 7 r yaC x V J a : m i d , e 9 - e ' electr °Phoresis. Following electroelution of the vector band (120 v for 2 h at 
n»Jn-, r I (Mamatis et a/.. Id.)) the DNA was purified by phenol/CHCI 3 extraction, ethanol 
precipitation and ethanol washing. 

ovh Alh t 0Ug h PA222 could be digested to completion ()98 percent) by either Knpl or Pstl separately, 
exhaustive double digestion was .ncomplete (({50 percent). This may have resulted from the fact that 
these sites were so close (10 bp) that digestion by Knpl allowed "breathing" of the DNA in the vicinity of the 

2Lr^' '' e " M ra "5 " e P aratlon or fra V in 9' Sin ce Pstl will only cleave double stranded DNA, strand 
separation could inhibit subsequent Pstl digestion. 

Example 14 

Ligation of Oligonucleotide Cassettes into the Subtilisin Gene 

10 pM of four complemetary oligonucleotide pools (A-D) Table C which were not 5' phosphorylated 
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were annealed in 20 ul ligase buffer by heating for five minutes at 68°C and then cooling for fifteen minutes 
at room temperature. 1 uM of each annealed oligonucleotide pool, ~0.2 ug Kpnl and Pstl-digested pA222 
obtained in Example 13, 0.5 mM ATP, ligase buffer and 6 units T 4 DNA ligase in 20 pL total volume was 
reacted overnight at 14°C to iigate the pooled cassettes in the vector. A large excess of cassettes (~300x 
over the pA222 ends) was used in the ligation to help prevent intramolecular Kpnl-Kpnl ligation. The 
reaction was diluted by adding 25 ul of 10 mM Tris pH 8 containing 1 mM EDTA. The mixture was 
reannealed to avoid possible cassette concatemer formation by heating to 68°C for five minutes and 
cooling for 15 minutes at room temperature. The ligation mixtures from each pool were transformed 
separately into £ coli 294 rec* cells. A small aliquot from each transformation mixture was plated to 
determine the number of independent transform ants. The large number of transformants indicated a high 
probability of multiple mutagenesis. The rest of the transformants ("200—400 transformants) were 
cultured in 4 ml of LB medium plus 12.5 ug chloramphenicol/ml. DNA was prepared from each 
transformation pool (A — D). This DNA was digested with Kpnl, "0.1 [xg was used to retransform £ coli rec* 
and the mixture was plated to isolate individual colonies from each pool. Ligation of the cassettes into the 
gene and bacterial repair upon transformation destroyed the Kpnl and Pstl sites. Thus, only pA222 was cut 
when the transformant DNA was digested with Kpnl. The cut piasmid would not transform £ coli. 
Individual transformants were grown in culture and DNA was prepared from 24 to 26 transformants per 
pool for direct piasmid sequencing. A synthetic oligonucleotide primer having the sequence 



5'-GAGCTTGATGTCATGGC-3' 

was used to prime the dideoxy sequencing reaction. The mutants which were obtained are described in 
Table C below. 

Two codon+222 mutants {i.e., gin and fie) were not found after the screening described. To obtain these 
a single 25mer oligonucleotide was synthesized for each mutant corresponding to the top oligonucleotide 
strand in Figure 12, Each was phosphorylated and annealed to the bottom strand of its respective 
nonphosphorylated oligonucleotide pool (i.e., pool A for gin and pool D for fie). This was ligated into Kpni 
and Pstl digested pA222 and processed as described for the original oligonucleotide pools. The frequency 
of appearance for single mutants obtained this way was 2/8 and 0/7 for gin and ile, respectively. To avoid 
this apparent bias the top strand was phosphorylated and annealed to its unphosphorylated and 
complementary pool. The heterophosphorylated cassette was ligated into cut pA222 and processed as 
before. The frequency of appearance of gin and ile mutants was now 7/7 and 7/7, respectively. 

The data in Table C demonstrate a bias in the frequency of mutants obtained from the pools. This 
probably resulted from unequal representation of oligonucleotides in the pool. This may have been caused 
by unequal coupling of the particular trimers overthe mutagenesis codon in the pool. Such a bias problem 
could be remedied by appropriate adjustment of trimer levels during synthesis to reflect equal reaction. In 
any case, mutants which were not isolated in the primary screen were obtained by synthesizing a single 
strand oligonucleotide representing the desired mutation, phosphorylating both ends, annealing to the 
pool of non-phosphorylated complementary strands and ligating into the cassette site, A biased 
heteroduplex repair observed for the completely unphosphorylated cassette may result from the fact that 
position 222 is closer to the 5' end of the upper strand than it is to the 5' end of the lower strand (see Figure 
12). Because a gap exists at the unphosphorylated 5' ends and the mismatch bubble in the double stranded 
DNA is at position 222, excision repair of the top strand gap would more readily maintain a circularly 
hybridized duplex capable of replication. Consistent with this hypothesis is the fact that the top strand 
could be completely retained by selective 5' phosphorylation. In this case only the bottom strand contained 
a 5' gap which could promote excision repair. This method is useful in directing biased incorporation of 
synthetic oligonucleotide strands when employing mutagenic oligonucleotide cassettes. 



Example 15 

Site-Specific Mutagenesis of the Subtilisin Gene at Position 166 

The procedure of Examples 13—14 was followed in substantial detail, except that the mutagenesis 
primer differed (the 37 mer shown in Fig. 13 was used), the two restriction enzymes were Sad and Xmail! 
rather than Pstl and Kpnl and the resulting constructions differed, as shown in Fig. 13. 

Bacillus strains exerting mutant subtilisins at position 166 were obtained as described below in 
Example 16. The mutant subtilisins exhibiting substitutions of ala, asp, gin, phe, his, lys, asn, arg, and val 
for the wild-type residue were recovered. 

Example 16 

Preparation of Mutant Subtilisin Enzymes 

B. subtilis strain BG2036 obtained by the method of Example 1 1 was transformed by the plasmids of 
Examples 14, 15 or 20 and by pS4 — 5 as a control. Transformants were plated or cultured in shaker flasks 
for 16 to 48 h at 37 C C in LB media plus 12.5 ug/ml chloramphenicol. Mutant enzymatically active subtilisin 
was recovered by dialyzing ceil broth against 0.01 M sodium phosphate buffer, pH 6.2. The dialyzed broth 
was then titrated to pH 6.2 with 1N HCI and loaded on a 2.5 x 2 cm column of CM cellulose (CM-52 
Whatman). After washing with 0.01 M sodim phosphate, pH 6.2, the subtilisins (except mutants at position 
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+222) were eiuted with the same buffer made 0.08N in NaCI. The mutant subtilisins at position +222 were 
each eiuted with 0.1 M sodium phosphate, pH 7.0. The purified mutant and wild type enzymes were then 
used in studies of oxidation stability, Km, Kcat, Kcat/Km ratio, pH optimum, and changes in substrate 
specificity. a 

Table C Oligonucleotide Pool Organization 

and Frequency of Mutants Obtained 



Pool 



B 



D 



Amino Acids 



Codon-222a 



Frequency^ 



asp GAT 

met ATG 

cys TGT 

a rg AGA 

gin GAA 
unexpected mutants^ 

leu CTT 

Pro CCT 

phe TTC 

tyr TAC 

Ms CAC 
unpexpected mutants 

glu GAA 

ala GCT 

thr ACA 

lys AAA 

asn AAC 

unexpected mutants 

giy GGC 

trp TGG 

lie ATC 

ser AGC 

val GTT 

unexpected mutants 



2/25 
3/25 
13/25 
2/25 
0/25 

5/25 

1/25. 

3/25 

6/25 

5/25 

1/25 

9/25 

3/17 
3/17 
1/17 
1/17 
1/17 
8/17 

1/23 
8/23 
0/23 
1/23 
4/23 
9/23 



Codons were chosen based on frequent use in the cloned 
subtil i sin gene sequence (Wei 1 s et al_. , 1983, Id.). 

Frequency was determined from single track analysis by direct 
plasmid sequencing. 

Unexpected mutants generally comprised double mutants with 
changes in codons next to 222 or at the points of ligation. 
These were believed to result from impurities in the 
obigonucleotide pools and/or erroneous repair of the gapped 
ends. 



60 



Example 17 

Mutant Subtilisin Exhibiting Improved Oxidation Stability 

Subtilisins having cysteine and alanine substituted at the 222 position for wild-type methionine 
(Example 16) were assayed for resistance to oxidation by incubating with various concentrations of sodium 
65 hypochlonde (Clorox Bleach). 
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To a total volume of 400 u! of 0.1 M, pH 7, NaP0 4 buffer containing the indicated bleach concentrations' 
(Fig. 14) sufficient enzyme was added to give a final concentration of 0.016 mg/ml of enzyme. The solutions 
were incubated at 25°Cfor 10 min. and assayed for enzyme activity as follows: 120 uJ of either ala+222 or 

e wild type, or 100 ul of the cys+222 incubation mixture was combined with 890 u.l 0.1 M tris buffer at pH 8 6 
and 10 ul of a sAAPFpN {Example 18) substrate solution (20 mg/ml in DMSO). The rate of increase in 
absorbence at 410 nm due to release of p-nitroaniline (Del Mar, E.G., et aL, 1979 "Anal. Biochem." 99 
316—320) was monitored. The results are shown in Fig. 14. The alanine substitution produced considerably 
more stable enzyme than either the wild-type enzyme or a mutant in which a labile cysteine residue was 

jq substitued for methionine. Surprisingly, the alanine substitution did not substantially interfere with enzyme 
activity against the assay substrate, yet conferred relative oxidation stability on the enzyme. The 
serine+222 mutant also exhibited improved oxidation stability. 

Example 18 

Mutant Subtiiisins Exhibiting Modified Kinetics and Substrate Specificity 

Various mutants for glycine+166 were screened for modified Kcat, Km and Kcat/Km ratios. Kinetic 
parameters were obtained by analysis of the progress curves of the reactions. The rate of reaction was 
measured as a function of substrate concentration. Data was analyzed by fitting to the Michaelis-Mention 
equation using the non-linear regression algorithm of Marquardt (Marquardt, D. W. 1963, "J. Soc. Ind 
AppL Math." 11, 431—41). All reactions were conducted at 25°C in 0.1M tris buffer, pH 8.6, containing 
benzoyl-L-Valyl-Glycyl-L-Arginyl-p-nitroanilide (BVGRpN; Vega Biochemicals) at initial concentrations of 
0.0025 M to 0.00026 M (depending on the value of Km for the enzyme of interest — concentrations were 
adjusted in each measurement so as to exceed Km) or succinyl-L-Alanyl-L-A!anyl-L-Prolyl-L-Phenylalanyl- 
p-nitroaniiide (sAAPFpN; Vega Biochemicals) at initial concentrations of 0.0010 M to 0.00028 M (varvina as 
described for BVGRpN). y y 

25 The results obtained in these experiments were as follows: 

Table D 

Substrate 
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sAAPFpN 



BVGRpN 



Enzyme 


Kcat (s-1) 


Km (M) 




Kcat/K 


166 (wild type) 37 


1.4xl0~ 4 


3 


x 10 5 


ala+166 


19 


2.7xl0~ 5 


7 


x 10 5 


asp+166 


3 


5.8xl0" 4 


5 


x 10 3 


gl u+166 


11 


3.4xl0' 4 


3 


x 10 4 


phe+166 


3 


1.4x10" 5 


2 


x 10 5 


hys+166 


15 


l.lxlO' 4 


1 


x 10 5 


lys+166 


15 


3.4xl0" 5 


4 


x 10 5 


aSn+166 


26 


1.4xl(T 4 


2 


x 10 5 


arg+166 


19 


6.2xl0" 5 


3 


x 10 5 


val+166 


1 


1.4xl0" 4 


1 


x 10 4 


Wild Type 


2 


l.lxlO" 3 


2 


x 10 3 


asp+166 


2 


4.1xl(T 5 


5 


x 10 4 


gl u+166 


2 


2.7xl0" 5 


7 


x 10 4 


asn+166 


1 


1.2xl0" 4 


8 


x 10 3 



The Kcat/Km ratio for each of the mutants varied from that of the wild-type enzyme. As a measure of 
catalytic efficiency, these ratios demonstrate that enzymes having much higher activity against a given 
eo substrate can be readily designed and selected by screening in accordance with the invention herein. For 
example, A166 exhibits over 2 times the activity of the wild type on sAAPFpN. 

This data also demonstrates changes in substrate specificity upon mutation of the wild type enzyme. 
For example, the Kcat/Km ratio for the D166 and E166 mutants is higher than the wild type enzyme with the 
BVGpN substrate, but qualitatively opposite results were obtained upon incubation with sAAPFpN. 
65 Accordingly, the D166 and E166 mutants were relatively more specific for BVGRpN than for sAAPFpN. 
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Example 19 

Mutant Subtilisin Exhibiting Modified pH-Activity Profile 

The pH profile of the Cys+222 mutant obtained in Example 1 6 was compared to that of the wild type 

t*%Ti k « °l 6 ° mg/m ' SAAPFpN in LJ DMS0 ' 10 MI of C V S+222 (°' 18 ™9' m! ) or wild type {0.5 mg/ml) and 
980 nl of buffer (for measurements at pH 6.6, 7.0 and 7.6, 0.1M NaP0< buffer; at pH 8.2, 8.6 and 9 2 0 1 M tris 
buffer; and at P H 9.6 and 10.0, 0.1M glycine buffer), after which the initial rate of change in absorbence at 
410 nm per minute was measured at each pH and the data plotted in Fig. 1 5. The Cys+222 mutant exhibits a 
sharper pH optimum than the wild type enzyme. 

Example 20 

Site-Specific Mutagenesis of the Subtilisin Gene at Position 169 

The procedure of Examples 13—14 was followed in substantial detail, except that the mutagenesis 
primer differed (the primer shown in Fig. 16 was used), the two restriction enzymes were Kpnl and EcoRV 
rather than Pstl and Kpnl and the resulting constructions differed, as shown in Fig. 16 

Bacillus strains excreting mutant subtilisins at position 169 were obtained as described below in 
Example 16. The mutant subtilisins exhibiting substitutions of aia and ser for the wild-type residue were 
recovered and assayed for changes in kinetic features. The assay employed SAAPFpN at pH 8.6 in the same 
fashion as set forth in Example 18. The results were as follows: 



Table E 

Enzyme Kcat (s" 1 ) Km (M) 



Kcat/Km 



58 7.5 x 10" 5 8 x 10 5 

ser+169 38 8.5 x TO" 5 4 x 10 5 



Example 21 

Alterations in Specific Activity on a Protein Substrate 



Position 166 mutants from Examples 15 and 16 were assayed for alteration of soecific antivitv nn a 
naturally occurring protein substrate. Because these mutant proteases cou™ ^ 
we as altered specific activity, the substrate should contain sufficient different ^c eaVage Se? 

h 6 1 f ^ dr ° phobic ' so as not to bias the assay toward a protease with one t^pe of specS' 
The substrate should also contain no derivatized residues that result in the maskina t^r^^all 

" refected 6 *!? ^ 35 h *™^»< -ocollogen,to^ 

TfnrSS™* f ? aSefn ' ° ° 2 Ch3I ' nS ' W3S Ch0S6n 35 3 Suitab,e substrate ' ' 

protoVoHs as follows" " ^ * 3 10 ° mM Tn ' S bUffer pH 8 ' 0 ' 10 mM EDTA " The assay 



40 790 pi 50 mM Tris pH 8.2 



45 



50 



55 



60 



65 



100 ul 1 percent casein (Sigma) solution 
' 10 \i\ test enzyme (10— 200 ug). 

This assay mixture was mixed and allowed to incubate at room temoerature for ?n mm,,t« th. 
reaction was terminated upon the addition of 1 00 pi 100 percent ^JZ^T^^Z^i by ncubaSn 
for 1 5 minutes at room temperature. The precipitated protein was pelleted by centrifuqat^n and t he ontS 
T!¥ ° f tl supernatan : was determined spectrophotometry at 280 nm ^ 
reflection of the amount of unprecipitated, i.e., hydrolyzed, casein in the reactton mixture tL fmount of 
casein hydrolysed by each mutant protease was compared to a series of rtanria^V ' n tl 

t the F wMd protease ' and the act[vity is « 

type activity. Enzyme activities were converted to a specific activitv bv riivirfinn tLT J j , 
activUy by the 280 nm absorbence of the enzyme solution Z jZ Z as'sa " " ^ 

All of the mutants which were assayed showed less specific activity on casein than the wild tvoe with 
the exception of Asn+ 66 which was 26 percent more active on casein than the S ty P T lhe mu^ nt 
showing the least spec.fic activity was ile+166 at 0.184 of the wild type activity. 



Claims 



the m e^hod e t°mSn e g neratin9 ' ° f 3< any 0m ^ W ^ the C0<S ^ ^ ° f ' P™tBin. 

(a) obtaining a DNA moiety encoding at least a portion of said protein; 

(b) identifying a region within the moiety; 

(c) substituting nucleotides for those already existing within the region in order to create at least one 
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restriction enzyme site unique to the moiety, whereby unique restriction sites 5' and 3' to the identified 
region are made available such that neither alters the amino acids coded for by the region as expressed; 

(d) synthesizing a plurality of oligonucleotides, the 5' and 3' ends of which each contain sequences 
capable of annealing to the restriction enzyme sites introduced in step (c) and which, when ligated to the 
moiety, are expressed as substitutions, deletions and/or insertions of at least one amino acid in or into the 
precursor protein; 

(e) digesting the moiety of step (c) with restriction enzymes capable of cleaving the unique sites; 

If) ligating each of the oligonucleotides of step (d) into the digested moiety of step (e) whereby a 
plurality of mutant DNA moieties are obtained; and optionally the further steps of 

(g) expressing each of said moieties as a mutant protein in a suitable host; 

(h) recovering the mutant proteins of step (g); and 

(i) screening the step (h) mutant proteins for the desirable characteristic. 

2. The method of claim 1 wherein the restriction enzyme sites are different. 

3. The method of claim 1 or claim 2 wherein the oligonucleotides are less than about 50 bp. 

Patentanspruche 

1. Verfahren zur Herstellung einer Vieizahl von Mutanten an einer beliebigen Stelle innerhalb des 
Kodierungsbereiches von Protein, umfassend: 

(a) Erhalten eines DIMA-Anteils, der wenigstens einen Abschnitt des genannten Proteips kodiert; 

(b) Identifizieren eines Bereiches innerhalb des Anteils; 

(c) Substituieren von Nucleotiden fur die schon innerhalb des Bereiches vorhandenen, urn wenigstens 
eine Restritctionsenzymstelle zu schaffen, die einmalig im Anteil ist, wodurch einmalige Restriktionsstellen 
5' und 3' vom identifizierten Bereich verfugbar gemacht werden, sodaS keine die vom Bereich kodierten 
Aminosauren verandert; 

(d) Synthetisieren einer Vieizahl von Oligonucleotiden, deren 5'- und 3'-Enden jeweils Sequenzen 
enthalten, die fahig sind, mit dem Restriktionsenzymstellen zu verschmelzen, die in Schritt (c) eingefuhrt 
werden und die, wenn sie mit dem Anteil ligiert werden, als Subtitutionen, Loschungen und/oder 
Einfugungen wenigstens einer Aminosaure in dem oder in das Vorlauferprotein exprimiert werden; 

(e) Digerieren des Anterls aus Schritt (c) mit Restriktionsenzymen, die fahig sind, die einmaligen Stellen 
zu spalten; 

(f) Ligieren jedes der Oligonucleotide aus Schritt (d) in den digerierten Anteil aus Schritt (e), wodurch 
eine Vieizahl von mutanten DNA-Anteilen erhalten wird; und wahlweise weiters die Schritte des 

(g) Exprimierens jedes der genannten Anteiie als ein mutantes Protein in einem geigneten Wirt; 

(h) Gewinnen der mutanten Proteine aus Schritt (g); und 

(i) Screenen de mutanten Proteine aus Schritt (h) in bezug auf die gewunschten Eigenschaften. 

2. Verfahren nach Anspruch 1, worin die Restriktionsenzymstellen unterschiedlich sind. 

3. Verfahren nach Anspruch 1 oder 2, worin die Oligonucleotide weniger als etwa 50 Basenpaare 
enthalten. 

Revendications 

1 . Methode pour la production d'un certain nombre de mutations en tout site dans la region de codage 
d'une proteine, la methode consistant a: 

(a) obtenir un fragment d'ADN codant au moins une portion de ladite proteine; 

(b) identifier une region dans le fragment; 

(c) substituer des nucleotides a ceux existant deja dans la region afin de creer au moins un site 
d'enzyme de restriction unique au fragment, permettant ainsi de disposer de sites uniques de restriction 
cotes 5' et 3' a la region identifiee, de maniere qu'aucun n'altere les acides amines que code la region telle 
qu'exprimee; 

<d) synthetiser un certain nombre d'oligonudeotides dont les extremites cotes 5' et 3' contiennent des 
sequences capables de recuire aux sites d'enzyme de restriction introduits a I'etape (c) et qui, lors d'une 
ligature au fragment, s'expriment sous la forme de substitutions, deletions et/ou insertions d'au moins un 
acide amine dans ou a I'interieur de la proteine precurseur; 

(e) digerer le fragment de I'etape (c) par des enzymes de restriction capables de scinder les sites 
uniques; 

(f) lier chacun des oligonucleotides de I'etape (d) dans le fragment dtgere de I'etape (a) pour obtenir 
ainsi un certain nombre de fragments mutants d'ADN; et facultativement les etapes suppiementaires de 

(g) exprimer chacun desdits fragments en tant que proteine mutante chez un hdte approprie; 

(h) recuperer les proteines mutantes de l'6tape (g); et 

(i) examiner les proteines mutantes de I'etape (h) a la recherche de la caracteristique souhaitable. 

2. Methode de la revendication 1 ou les sites d'enzyme de restriction sont differents. 

3. Methode de la revendication 1 ou de la revendication 2 ou les oligonucleotides sont inferieurs a 
environ 50 pb. 



21 



EP 0 247 647 B1 



EcoRl 
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Cla\ PvuW 



Bom HI 



DO ( HI Y///,///>//////;//;;,<v;;/;;;/^ 

P RBS^PRE^-PRO 



MAT 

• 5kb- 



TERM 



B. 



f 



RBS 



-K7 

1 JfpJ'J^^,^- TA T i ATTCC AT^AJ^T^A TACAAT T AATACACA/iAATAATCTGICT Al TGGTTATTf lGCAAATr,AAAAAAAGGAr,Af;r;ATAAAGA n7(; 

-inn PRE 

Arti GW lvs lvs Val Trp He Ser leti Leo Phe Al* Leu Ala Leu Mr Phe Ttir »vt Ala Phe GW Ser Thr Ser 
90 AGA ARC AAA AAA GTA TGG ATC AGT TTG CTf. TTT GCT TTA GCG HA ATf TTT ACG ATG GCG TTC GGC AGP AC* TCC 

-fln _ 7 n PRO 

Ser Ala Gin Ala AU GW Lvs Ser As* Glv Glu lvs Lvs Wr ] le Val G W Phe Lvs G In Thr Met Ser Thr He t 
I 74 TCT GCC CAG GCC GCA GGG AAA TCA AAC GGG GAA AAG AAA TAT ATT GTC GGG TTT AAA CAG ACA ATG AGC ACG ATG 

-SO 

Ser Ala Ala Irs Lvs lys Asp Val He Ser Glu Lvs Gtv GW Lvs Val Gin Lvs Gin Phe Lvs Tvr Vat Asn Ala 
?4° AGC GCC GCT AAG A AG AAA GAT GTC ATT TCT GAA AAA GT*T GGG AAA GTG CAA A AH CAA TTC AAA TAT GTA GAC flC A 

_?n _ ]n 
AW Ser Ala Thr Leu Asn Gin Lvs Ala Val lvs Glu leu Lvs Lvs Asn Pro Val Ala Tvr VaI Glu Glu Asp 
3?4 GCT TCA GCT ACA TTA AAC GAA AAA GCT GTA AAA GAA TTG AAA AAA GAC CCG AGC GTC GCT TAC GTT GAA GAA GAT 

rr* MAT in - 

His Val Ala His Ala Tyr Ala Gin Ser Vil Pro Tyr Glv Val Ser Gin He Lvs All Pro All Leu His Ser Gin 

3 <W CAt GTA GCA CAT GCG TAC GCG CAG TCC GTG CCT TAC GGC GTA TCA CAA ATT AAA GCC CCT GCT CTG CAC TCT CAA 

?n 3n 4n 

Glv Tvr Thr Glv Ser Asn Val lvs Val Ala Val lie Asn Vr Glv 11* Asn Vr Vr His Prn Asn Leu lv<; VaI 
4 74 GGC TAC ACT GGA TCA AAT GTT AAA GTA GCG GTT ATC GAC AGC GGT ATC GAT TCT TCT CAT CCT GAT TTA A AG GTA 

50 Pro Asn fin a S p 

Ala Gly Gly Ala Ser Met Val Pro Ser Glu Thr Asn Pro Phe Gin Asp Asn Asn Ser His Glv Thr His Val Ala 
549 GCA GGC GAA GCC AGC ATG GTT CCT TCT GAA ACA AAT CCT TTC CAA GAC AAC AAT TCT CAC t** ACT r.tr f,TT GCC 

70 Pn <> r Al4 on 

Glv Thr val Ala Ala Leu Asn Asn Vr He &W Val leu Glv Val Ala Pro Vr Ala Vr iei» Tvr A 1 a Val Lvs 
624 GGC ACA GTT GCG GCT CTT AAT AAC TCA ATC GGT GTA TTA GGC GTT GCG CCA AGC GCA TCA CTT TAC GCT nTA AAA 

Asn Al a jnn j j n 

Val Ley GW Ala Asd Glv Ser Glv Gin Tyr Ser Trp He He Asn Glv He Glu Trp Ala It* AU Asn Asn Met 

ft 00 GTT CTC GGT GCT GAC GGT TCC GGC CAA TAC AGC TGG ATC ATT AAC W,A ATC GAf, TGT, f,CG ATC GCA AAC AAT ATf- 

\?n \y\ ]4 * 

Asd Val He Asn *vt Ser Leu Glv Glv Pro Ser Glv Ser Al« Al a leu lv\ Ala Al a VaI a, p Lvs Ala VaI Ala 
774 GAC GTT ATT AAC ATG AGC CTC GGC GGA CCT TCT GGT TCT GCT GCT TTA AAA GCG GrA GTT GAT AAA GCC r,TT GCA 

IS* Ser Thr l*n 

Ser GW Val Val Val Val Ala Ala Ala Glv Asn Glu Gl y Thr Ser Gly Ser Ser Vr Thr Val GW Tvr Pro Glv 
fl4° TCC GGC GTC GTA GTC GTT GCG GCA GCC GGT AAC GAA GGC ACT TCr (U^ AGC TCA AGC ACA GTG GGC TAC CCT f^T 

1 7" 1 fln i 

Lvs Tyr Pro Ser Val He Ala Val Gly Ala Val Asn Vr Vr *srt Gin Am Ala Vr Phe Vr Vr v*l r,i v o r n 
9?4 AAA TAC CCT TCT GTC ATT GCA GTA GGC GCT GTT GAC AGC AGC AAC CAA AGA GCA TCT TTC TCA AG r GTA GGA CCT 

?nn ?|ri 
Glu Leu Aso Val Het Ala Pro GW Val Ser He Gin Ser Thr leu Pro Glv A<n Lvs Tvr Glv Ala Tvr Asn Glv 
90? GAG CTT GAT GTC ATG GCA CCT GGC GTA TCT ATC CAA AGC ACG CTT CCT GGA AAC AAA TAC GGG GCG TA r AAC GGT 

??" 7T» ?an 

Thr Ser eel Ala Vr Pro His Val All Glv Ala Ala Ala Leu lie Lru Ser Lvs His °rn Asn Trp Thr Asn Thr 
1074 ACG TCA ATG GCA TCT CCG CAC GTT GCC GGA GCG GCT GCT TTG ATT CTT TCT AAG CAC CCG AAC TGG ACA A A" ACT 

Gin ?w\ 
Gin Val Arq Ser Ser Leu Glu Asn Thr Thr Thr Lvs Leu Glv Asp S*r Phe Tyr Tyr GW Lvs GW L*u He Asn 
1140 CAA GTC CGC AGC AGT TTA GAA AAC ACC ACT ACA AAA CTT GGT GAT TCT TTC TAC TAT GGA AAA GGG CTG ATC AAT 

Val Gin Ala Al Ala Gin T I tnlVl 

1>?4 GTA CAC. GCC GCA T,CT CAG TAA AAC^TAAAAAACCGGrrTTGGCCCCGf CnC^TTTTTATT ATT TTT CT TCCTCf"f",f4Tr. TT(;i4 Trrr,rT r ■* 

131ft ATA4TCGACGGAT&GC7CCCTCTGAAAATTTTAACGAGAAACGGCGGGTTGACCCGGCTCAr.TC^CGTAArr,r,-r AA r,T C r Tr , £ftA ^- T - Tr atT rr.rrr. 



Ml* r T TCf£rxaTTrrc^TCAGCTCAATG^cr.TAACGnTrr,ftrrx,rf;TTTTcrTr,ATArrrxG&r.4^r,r.rATTrc:ArTrr,r,c- 
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Fig 2. 



Panel A: probe I [ 32 P-AaCaa?ATGGa£gt] 

I 2 4 5 

t A W A \ ( A \t A \ 
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Fig.3. 
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Fig. U. 
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Fig?. 



I 6ATATACCTAAATAGAGATAAAATCATCTCAAAAAAATGGGTCTACTAAAATATTATTCCATCTATTACAATAAATTCACAGAATAGTCT7TTAAGTAAG 

-100 

fM *t Ar 9 Ser Lys Lys Leu Trp Me Ser Leu Leu Phe Alj Leu Thr L*u 
101 TC TACTCTGAAT TTTTTTAAAA6GAGAGGGTAA AGA GTG AGA AGC AAA AAA TTG TGG ATC AGC TTG TTG TTT 6CG TTA ACG TTA 

-90 -80 -70 

He Phe Thr Met Ala Phe Ser Asn Met Ser Ala Gin Ala Al-a Gly Lys Ser Ser Thr Glu Lys Lys Tyr n- v* 1 
185 ATC TTT ACG ATG GCG TTC AGC AAC ATG TCT GCG CAG GCT GCC GGA AAA AGC AGT AC A GAA AAG AAA T AC ATT GTC 

-60 .50 
Gly Phe Lys Gin Thr Met Ser Ala Met Ser Ser Ala Lys Lys Lys Asp Val He Ser Glu Lys Gly Glv Lv* Val 
260 GGA TTT AAA CAG ACA ATG AGT GCC ATG AGT 7CC GCC AAG AAA AAG GAT GTT ATT TCT GAA AAA GGC GGA AAG GTT 



-«0 -30 .20 

Gin Lys Gin Phe Lys Tyr Val Asn Ala Ala Ala Ala Thr Leu Asp Glu Lys Ala Val Lys Glu L*u Lys Lys Asp 

335 CAA AAG CAA TTT AAG TAT GTT AAC GCG GCC GCA GCA ACA TTG GAT GAA AAA GCT GTA AAA GAA TTG AAA AAA GAT 

-10 -1 l 10 

Pro Ser Val Ala Tyr Val Glu Glu Asp His He Ala His Glu Tyr Ala Gin Ser Val Pro Tyr Gly He Ser Gin 

410 CCG AGC GTT GCA TAT 6TG GAA GAA GAT CAT ATT GCA CAT GAA TAT GCG CAA TCT GTT CCT TAT GGC ATT TCT CAA 

?0 30 32 

He Lys Ala Pro Ala Leu His Ser Gin Gly Tyr Thr Gly Ser Asn Val Lys Val Ala Val He Asp Ser Gly He 

485 ATT AAA GCG CCG GCT CTT CAC TCT CAA GGC TAC ACA GGC TCT AAC GTA AAA GTA GCT GTT ATC G AC AGC GGA ATT 

<D 50 _ 60 

Asp Ser Ser His Pro Asp Leu Asn Val Arg Gly Gly Ala Ser Phe Val Pro Ser Glu Thr Asn Pro Tyr Gin As D 

560 GAC TCT TCT CAT CCT GAC TTA AAC GTC AGA GGC GGA GCA AGC TTC GTA CCT TCT GAA ACA AAC CCA TAC CAG G AC 

64 70 80 

Gly Ser Ser His Gly Thr His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu Gly Val Ser 

635 GGC AGT TCT CAC GGT ACG CAT GTA GCC GGT ACG ATT GCC GCT CTT AAT AAC TCA ATC GGT GTT CTG GGC GTT AGC 

9° 100 HO 

Pro Ser Alt Ser Leu Tyr Ala Val Lys Val Leu Asp Ser Thr Gly Ser 61y Gin Tyr Ser Trp He He Asn Gly 

710 CCA AGC GCA TCA TTA TAT GCA GTA AAA GTG CTT GAT TCA ACA GGA AGC GGC CAA TAT AGC TGG ATT ATT AAC GGC 

120 130 

lie Glu Trp Ala He Ser Asn Asn Met Asp Val He Asn Met Ser Leu Gly Gly Pro Thr Gly Ser Thr Ala Leu 

785 ATT GAG TGS GCC ATT TCC AAC AAT ATG GAT GTT ATC AAC ATG AGC CTT GGC GGA CCT ACT GGT TCT ACA GCG CTG 

HO 150 160 

Lys Thr Val Val Asp Lys Ala Val Ser Ser Gly lie Val Val Ala Ala Ala Ala Gly Asn Glu Gly Ser Ser Glv 

860 AAA ACA GTC GTT GAC AAA GCC GTT TCC AGC GGT ATC GTC GTT GCT GCC GCA GCC GGA AAC GAA GGT TCA TCC GGA 

170 180 

Ser Thr Ser Thr Val Gly Tyr Pro Ala Lys Tyr Pro Ser Thr He Ala Val Gly Ala Val Asn Ser Ser Asn Gin 

935 AGC ACA AGC ACA GTC S6C TAC CCT GCA AAA TAT CCT TCT ACT ATT GCA GTA GGT GCG GTA AAC AGC AGC AAC CAA 

? 90 200 210 

Pro 



AGC ACA CTT CCT 



Arg Ala Ser Phe Ser Ser Ala Gly Ser Glu Leu Asp Val Met Ala Pro Gly Val Ser He Gin 

1010 AG* 6CT TC* TTC TCC AGC GCA GGT TCT GAG CTT GAT GTG ATG GCT CCT GGC GTG TCC ATC CAA 

220 221 230 

Gly Gly Thr Tyr 61y Ala Tyr Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala Ala Ala Leu He Leu 

1085 GGA GGC ACT TAC GGC GCT TAT AAC GGA ACG TCC ATG GCG ACT CCT CAC GTT GCC GGA GCA GCA GCG TTA ATT CTT 

2*0 250 260 

Ser Lys His Pro Thr Trp Thr Asn Ala Gin Val Arg Asp Arg Leu Glu Ser Thr Ala Thr Tyr Leu Gly Asn Ser 

1160 TCT AAG CAC CCG ACT TGG ACA AAC GCG CAA GTC CGT GAT CGT TTA GAA AGC ACT GCA ACA TAT CTT GGA AAC TCT 

270 

Phe Tyr Tyr Gly Lys Gly Leu lie Asn Val Gin Ala Ala Ala Gin OC 

1235 TTC TAC TAT GGA AAA GGG TTA ATC AAC GTA CAA GCA GCT GCA CAA TAA TAGTAAAAAGAAGCAGGTTCCTCC ATA CCTGCTT C 



1318 TTTTTATTTGTCAGCATCCTGATGTTCCGGCGCATTCTCTTCTTTCTCCGC ATGTTGA ATCCGTTCC ATGATCGACGGATGGCTG CC T CTGAAAATCTTC 
1418 ACAAGCACCGGAGGATCAACCTGCTCAGCCCCGTCACGGCCAAATCCT6AAACGTTTTAACACTGGCTTCTCTGTTCTCTGTC 
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